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NEW HCV ISOLATES 



10 



Technical Field 

The present invention relates to new isolates 
of the viral class Hepatitis C, polypeptides, 
20 polynucleotides and antibodies derived therefrom, as well 
as the use of such polypeptides, polynucleotides and 
antibodies in assays (e.g., immunoassays, nucleic acid 
hybridization assays, etc.) and in the production of 
viral polypeptides. 

25 

Background 

Non-A, Non-B hepatitis (NANBH) is a 
transmissible disease or family of diseases that are 
believed to be viral-induced, and that are 

30 distinguishable from other forms of viral-associated 
liver diseases, including that caused by the known 
hepatitis viruses , i.e., hepatitis A virus (HAV) , 
hepatitis B virus (HBV) , and delta hepatitis virus (HDV) , 
as well as the hepatitis induced by cytomegalovirus (CMV) 

35 or Epstein-Barr virus (EBV) . NANBH was first identified 
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in transfused individuals. Transmission from man to 
chimpanzee and serial passage in chimpanzees provided 
evidence that NANBH is due to a transmissible infectious 
agent or agents* Epidemiologic evidence is suggestive 
5 that there may be three types of NANBH: the water-borne 
epidemic type; the blood or needle associated type; and 
the sporadically occurring (community acquired) type. 
However, until recently, no transmissible agent 
responsible for NANBH had not been identified. 

10 Clinical diagnosis and identification of NANBH 

has been accomplished primarily by exclusion of other 
viral markers. Among the methods used to detect putative 
NANBH antigens and antibodies are agar-gel diffusion, 
counterimmunoelectrophoresis, immunofluorescence 

15 microscopy, immune electron microscopy, radioimmunoassay, 
and enzyme-linked immunosorbent assay. However, none of 
these assays has proved to be sufficiently sensitive, 
specific, and reproducible to be used as a diagnostic 
test for NANBH. 

20 Until recently there has been neither clarity 

nor agreement as to the identity or specificity of the 
antigen antibody systems associated with agents of NANBH. 
It is possible that NANBH is caused by more than one 
infectious agent and unclear what the serological assays 

25 detect in the serum of patients with NANBH. 

In the past, a number of candidate NANBH agents 
were postulated. See, e.g., Prince (1983) Ann. Rev. 
Microbiol. 27:217; Peinstone & Hoofnagle (1984) New Eng. 
J. Med. 311:18,5; Overby (1985) Curr. Heptol. 5:49; Overby 

30 (1986) Curr. Heptol. 1:65; Overby (1987) Curr. Heptol. 
2:35; and Iwarson (1987) British Med. J. 295:946. 
However, there is no proof that any of these candidates 
represent the etiological agent of NANBH. 

In 1987, Houghton et al. cloned the first virus 

35 definitively linked to NANBH. See, e.g., EPO Pub. No. 
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318,216; Houghton et al., Science 244 :359 (1989). 
Houghton et al. described therein the cloning of an 
isolate from a new viral class, hepatitis C virus (HCV) , 
the prototype isolate described therein being named 
5 "HCV1 M . HCV is a Flavi-like virus, with an RNA genome. 
Houghton et al. described the production of recombinant 
proteins from HCV sequences that are useful as diagnostic 
reagents, as well as polynucleotides useful in diagnostic 
hybridization assays and in the cloning of additional HCV 
10 isolates. 

The demand for sensitive, specific methods for 
screening and identifying carriers of NANBH and NANBH 
contaminated blood or blood products is significant* 
Post-transfusion hepatitis (PTH) occurs in approximately 

15 10% of transfused patients, and NANBH accounts for up to 
90% of these cases. There is a frequent progression to 
chronic liver damage (25-55%) . 

Patient care as well as the prevention of 
transmission of NANBH by blood and blood products or by 

20 close personal contact require reliable diagnostic and 
prognostic tools to detect nucleic acids, antigens and 
antibodies related to NANBH. In addition, there is also 
a need for effective vaccines and immunotherapeutic 
therapeutic agents for the prevention and/or treatment of 

25 the disease. 

While at least one HCV isolate has been 
identified which is useful in meeting the above needs, 
additional isolates, particularly those with divergent a 
genome, may prove to have unique applications. 

30 

Summary of the Invention 

New isolates of HCV has been characterized from 
Japanese blood donors who have been implicated as NANBH 
carriers. These isolates exhibit nucleotide and amino 
35 acid sequence heterogeneity with respect to the prototype 
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isolate, HCVl, in several viral domains, it: is believed 
that these distinct sequences are of in importance, 
particularly in diagnostic assays and in vaccine 
development. 

5 In one embodiment, the present invention 

provides a DNA molecule comprising a nucleotide sequence 
of at least 15 bp from an HCV isolate substantially 
homologous to an isolate selected from the group Jl or 
J7, wherein said nucleotide sequence is distinct from the 

10 nucleotide sequence of HCV isolate HCVl. 

In another embodiment, the present invention 
provides a DNA molecule comprising a nucleotide sequence 
of at least 15 bp encoding an amino acid sequence from a 
HCV isolate Jl or J7 wherein the Jl or J7 amino acid 

15 sequence is distinct from the amino acid sequence of HCV 
isolate HCVl. 

Yet another embodiment of the present invention 
provides a purified polypeptide comprising an amino acid 
sequence from an HCV isolate substantially homologous to 

20 an isolate selected from the group Jl and J7, wherein 

said amino acid sequence is distinct from the sequence of 
the polypeptides encoded by the HCV isolate HCVl. 

Still another embodiment of the present 
invention provides a polypeptide comprising an amino acid 

25 sequence from a HCV isolate Jl or J7 wherein the Jl or J7 
amino acid sequence is distinct from the amino acid 
sequence of HCV isolate HCV1 and the polypeptide is 
immobilized on a solid support. 

In a further embodiment of the present 

30 invention, an immunoassay for detecting the presence of 
anti-HCV antibodies in a test sample is provided 
comprising: (a) incubating the test sample under 
conditions that allow the formation of antigen-antibody 
complexes with an immunogenic polypeptide comprising an 

35 amino acid sequence from an HCV isolate substantially 
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homologous to an isolate selected from the group Jl and 
J7, wherein the amino acid sequence is distinct from the 
amino acid sequence of HCV isolate HCVl; and (b) 
detecting an antigen-antibody complex comprising the 
5 immunogenic polypeptide. 

The present invention also provides a 
composition comprising anti-HCV antibodies that bind an 
HCV epitope substantially free of antibodies that do not 
bind an HCV epitope, wherein: (a) the HCV epitope 

10 comprises an amino acid sequence from an HCV isolate 

substantially homologous to an isolate selected from the 
group Jl and J7, wherein the amino acid sequence is 
distinct from the amino acid sequence of HCV isolate 
HCVl; and (b) the Jl or J7 amino acid sequence is not 

15 immunologically cross-reactive with HCVl. 

A further embodiment of the present invention 
provides an immunoassay for detecting the presence of an 
HCV polypeptide in a test sample comprising: (a) 
incubating the test sample under conditions that allow 

20 the formation of antigen-antibody complexes with anti- 
HCV antibodies that bind an HCV epitope wherein: (i) the 
HCV epitope comprises an amino acid sequence from a HCV 
isolate Jl or J7; (ii) the Jl or J7 amino acid sequence 
is distinct from the amino acid sequence of HCV isolate 

25 HCVl; and (iii) the Jl or J7 amino acid sequence is not 
immunologically cross-reactive with HCVl; and (b) 
detecting an antigen-antibody complex comprising the 
anti-HCV antibodies. 

Also provided by the present invention is a 

30 method of producing anti-HCV antibodies comprising 

administering to a mammal a polypeptide comprising an 
amino acid sequence from a HCV isolate Jl or J7 wherein 
the Jl or J7 amino acid sequence is distinct from the 
amino acid sequence of HCV isolate HCVl whereby the 

35 mammal produces anti-HCV antibodies. 
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Yet another embodiment: of the present invention 
provides a method of detecting HCV polynucleotides in a 
test sample comprising: (a) providing a probe comprising 
the DNA molecule of claim l; (b) contacting the test 
5 sample and the probe under conditions that allow for the 
formation of a polynucleotide duplex between the probe 
and its complement in the absence of substantial 
polynucleotide duplex formation between the probe and 
non-HCV polynucleotide sequences present in the test 

10 sample; and (c) detecting any polynucleotide duplexes 
comprising the probe. 

A still further embodiment of the present 
invention provides a method of producing a recombinant 
polypeptide comprising an HCV amino acid sequence , the 

15 method comprising: (a) providing host cells transformed 
by a DNA construct comprising a control sequences for the 
host cell operably linked to a coding sequence encoding 
an amino acid sequence from a HCV isolate Jl or J7 
wherein the Jl or J7 amino acid sequence is distinct from 

20 the amino acid sequence of HCV isolate HCVl; (b) growing 
the host cells under conditions whereby the coding 
sequence transcribed and translated into the recombinant 
polypeptide; and (c) recovering the recombinant 
polypeptide . 

25 These and other embodiments of the present 

invention will be readily apparent to those of ordinary 
skill in the art in view of the following description. 

Brief De scription of the Figures 
30 Figure 1 shows the consensus sequence of the 

coding strand of a fragment from the J7 C/E domain with 

the heterogeneities. 

Figure 2 shows the consensus sequence of the 

coding strand of a fragment from the Jl E domain with the 
3 5 heterogeneities . 
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Figure 3 shows the consensus sequence of the 
coding strand of a fragment of the Jl E/NS1 domain with 
the heterogeneities* 

Figure 4 shows the consensus sequence of the 
5 coding strand of a fragment from the Jl NS3 domain with 
the heterogeneities* 

Figure 5 shows the consensus sequence of the 
coding strand of a fragment from the Jl NS5 domain with 
the heterogeneities, 
10 Figure 6 shows the homology of the J7 C/E 

consensus sequence with the nucleotide sequence of the 
same domain from HCV1. 

Figure 7 shows the homology of the Jl E 
consensus sequence with the nucleotide sequence of the 
15 same domain from HCV1. 

Figure 8 shows the homology of the Jl E/NS1 
consensus sequence with the nucleotide sequence of the 
same domain from HCV1. 

Figure 9 shows the homology of the Jl NS3 
20 consensus sequence with the nucleotide sequence of the 
same domain from HCV1 . 

Figure 10 shows the homology of the Jl NS5 
consensus sequence -with the nucleotide sequence of the 
same domain from HCV1. 
25 Figure 11 shows the putative genomic 

organization of the HCV1 genome. 

Figure 12 shows the nucleotide sequence of the 
ORF of HCV1. In the figure nucleotide number 1 is the 
first A of the putative initiating methionine of the 
30 large ORF; nucleotides upstream of this nucleotide are 
numbered with negative numbers. 

Figure 13 shows the consensus sequence of the 
coding strand of a fragment from the Jl NS1 domain (Jl 
1519) with the nucleotide sequence of the same domain 



35 
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from HCV1. Also shown are the amino acids encoded 
therein. 

Figure 14 shows a composite of the consensus 
sequence from the core to the NS1 domain of Jl with the 
5 nucleotide sequence of the same domain from HCV1. Also 
shown are the amino acids encoded therein. 

Figure 15 shows a consensus sequence of the 
coding strand of the NS1 domain of Jl, as determined in 
Example IV. Also shown are the nucleotide sequence of 
10 the same domain from HCVl, and the amino acids encoded in 
the HCVl and Jl sequences. 

Figure 16 shows a consensus sequence of a 
coding strand of the C200 region of the NS3-NS4 domain of 
Jl. Also shown are the nucleotide sequence of the same 
15 domain from HCVl. Also shown are the amino acids encoded 
in the sequences. 

Figure 17 shows a consensus sequence of the 
coding strand of the NS1 domain of Jl, as determined in 
Example V. Also shown are the nucleotide sequence of the 
20 same domain from HCVl, and the amino acids encoded in the 
sequences. 

Figure 18 shows a consensus sequence of the 
coding strand of the untranslated and core domains of Jl. 
Also shown are the nucleotide sequence of the same domain 
25 from HCVl, and the amino acids encoded in the sequences. 

Detailed Descrip tion of the Invention 

The practice of the present invention will 

30 employ, unless otherwise indicated, conventional 
techniques of molecular biology, microbiology, 
recombinant DNA techniques, and immunology, which are 
within the skill of the art. Such techniques are 
explained fully in the literature. Sgg e.g., Maniatis, 

35 Fitsch & Sambrook, MOLECULAR CLONING; A LABORATORY MANUAL 
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(1982); DNA CLONING, VOLUMES I AND II (D . N Glover ed . 
1985); OLIGONUCLEOTIDE SYNTHESIS (M.J. Gait ed 1984). 
. NUCLEIC ACID HYBRIDIZATION (B.D. Ha.es * s.J. Higgins 
eds. 1984); TRANSCRIPTION AND TRANSLATION (B.D. Hames * 
S.J. Higgins eds. 1984); ANIMAL CELL CULTURE (R I 
Freshney ed. 1986); IMMOBILIZED CELLS AND ENZYMES " (IRL 

C^G 1 ^;/' P6rba1 ' A PRACTICAL GMDE * MOLECULAR 
CLONING (1984); the series, METHODS IN ENZYMOLOGY 

(Academic Press, Inc.); GENE TRANSFER VECTORS FOR 
10 MAMMALIAN CELLS (J.„. Mil ler and M.P. Calos 

Cold sprang Harbor Laboratory,, Methods in Enzymology 
Vol. 154 and vol. 155 (wu and Grossman, and Wu, eds^ 
respectively,, Mayer and Walker, eds. (1987, 
IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR ' BIOLOGY 
15 (Academic Press, London,, Scopes, (1987,, protein 

PURIFICATION: PRINCIPLES AND PRACTICE, Second Edition 

^I^IT**' ^ HMna,OT ° F ^X^RIMENTAL IM- 

MUNOLOGY VOLUMES I-IV (D.M. Weir and C. C. Blackwell eds 

20 till: . Pat6ntS ' Pat6nt ^"^ions, -d other 
20 publications mentioned he*-*»Hn k„*.w 

h«r-»h„ * ^xonea herein, both supra and infra, are 

hereby incorporated herein by reference. 

The term "hepatitis c virus- has been reserved 

by workers in the field for an heretofore unknown 

25 Z^ is TT ° f „ MANBH - as used herein, 

NANBH which f (HCV) ^ t0 ^ ^ CaUSat -e of 
NANBH, which was formerly referred to as NANBV and/or 

BB-NANBV from the class of the prototype isollte H "i 

TstT 7" OU9hton et al - see - ~ IT' 

May 1989 (available in non-U. s. applications claiming 
Priority therefrom, , the disclosures of which are 
incorporated herein by reference. The nucleotide 
sequence and putative amino acid sequence of HCV1 is 
35 «riL n ^ *»» HCV, NANBV, and BB-NANBV 

are used interchangeably herein. As an extension of this 
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terminology, the disease caused by HCV, formerly called 
NANB hepatitis (KANBH) , is called hepatitis C. The terms 
NANBH and hepatitis C may be used interchangeably 
herein. The term "HCV", as used herein, denotes a viral 
5 species of which pathogenic strains cause NANBH, as well 
as attenuated strains or defective interfering particles 
derived therefrom* 



composition of Flavivirus particles are known, and are 
10 discussed by Brinton (1986) THE VIRUSES: THE TOGAVIRIDAE 
AND FLAVIVIRIDAE (Series eds. Fraenkel-Conrat and Wagner, 
vol eds. Schlesinger and Schlesinger, Plenum Press) , 
p. 327-374. Generally, with respect to morphology, 
Flaviviruses contain a central nucleocapsid surrounded by 
15 a lipid bilayer. Virions are spherical and have a 

diameter of about 40-50 nm. Their cores are about 25-30 
nm in diameter. Along the outer surface of the virion 
envelope are projections that are about 5-10 nm long with 
terminal knobs about 2 nm in diameter. 
20 The HCV genome is comprised of RNA. It is 

known that RNA containing viruses have relatively high 
rates of spontaneous mutation, i.e., reportedly on the 

—3 —4 

order of 10 to 10 per incorporated nucleotide. 
Therefore, there are multiple strains, 
25 which may be virulent or avirulent, within the HCV class 
or species. 



is comprised of a single ORF of approximately 9,000 
nucleotides to approximately 12,000 nucleotides, encoding 

30 a polyprotein similar in size to that of HCV1, an encoded 
polyprotein of similar hydrophobic and antigenic 
character to that of HCVl, and the presence of co-linear 
peptide sequences that are conserved with HCVl. In 
addition, the genome is believed to be a 

35 positive-stranded RNA. 



HCV is a Flavi-like virus. The morphology and 



It is believed that the genome of HCV isolates 
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Isolates of HCV comprise epitopes that are 
immunologically cross-reactive with epitopes in the HCV1 
genome. At least some of these are epitopes unique to 
HCV when compared to other known Flavi viruses. The 
5 uniqueness of the epitope may be determined by its 
immunological reactivity with anti-HCV antibodies and 
lack of immunological reactivity with antibodies to other 
Flavi virus species. Methods for determining 
immunological reactivity are known in the art, for 
10 example, by radioimmunoassay, by ELISA assay, by 

hemagglutination, and several examples of suitable 
techniques for assays are provided herein. 

It is also expected that the overall homology 
of HCV isolates and HCV1 genomes at the nucleotide level 
15 probably will be about 40% or greater, probably about 60% 
or greater, and even more probably about 80% to about 90% 
or greater. In addition that there are many corresponding 
contiguous sequences of at least about 13 nucleotides 
1 • that are fully homologous. The correspondence between 
20 ' the sequerice from a new isolate and the HCV1 sequence can 
|!^;!;!;: ■ i be determined by techniques known in the art. For 

example, they can be determined by a direct comparison of 
the sequence information of the polynucleotide from the 
new isolate and HCVl sequences. Alternatively, homology 
25 can be determined by hybridization of the polynucleotides 
' under conditions which form stable duplexes between 
homologous regions (for example, those which would be 
used prior to S ± digestion) , followed by digestion with 
single-stranded specific nuclease (s), followed by size 
30 determination of the digested fragments. 

Because of the evolutionary relationship of the 
strains or isolates of HCV, putative HCV strains or 
isolates are identifiable by their homology at the 
polypeptide level. Thus, new HCV isolates are expected 
35 to be more than about 40% homologous, probably more than 
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III hit 090 " 8 - ana even - ore — *» 

hOBOl ° 90,,S ' «" even -ore than about 

tor " P ° lyPeptida «» technics 

ottered di, ^ ' ^ aC " S *^"« -r "e 
determined directly and compared to the sequences 

provided herein. Alternatively the nucleotide sequence 
of the genomic material of the putative HCV may bf 
determined, the amino acid sequence encoded therein „„ 
« be determined, and the corroding regions^ed 

Th. orp of HCV1 is shown in Figure 12 The 
non-structural, core, and envelop, domain of ^ 

^ ™ haVe i b ' e " PrSdlCtea f ° r " CT1 ««— »> • - 
15 thVj t™ POlyPePtide ls ^"ved to be encoded from 

^ !< ^ t0 ab ° Ut ""^"t". 345 of HCV!. The 

putative -E», or envelope, domain o, HCV. is beUeve^ to 

o -ught to m^-Tr: "TiSL^ "bo t 

20 nucleotide 1953 For . hfl . . UCAeociae 1051 to about 
4. remaining domains, putative usa 

is thought to be encoded from about nucleoid! 1954 to 

about nucleotide 3018, putative NS3 fro* ! 
, n . Q . ^ ' * " ve W53 «*>m about nucleotide 

3019 to about nucleotide 4950, putative NS4 ft*. C * eo * lde 
nucleotide 4951 to about nucleoLe 6«7 J ^. 
- NS5 from about nucleotide 6298 TfJ^S 

c!n ! ! T yS1S ° f thB ° RP - The boundaries *' 

can be determined by those skilled in the art 2 T „ 
the disclosure herein. ln V16W of 

"HCV/Ji" or -ji- and u HCV/J7 „ or 
new HCV isolates characterized bv *-ho f , 
^closed herein, as ^^^4^? "~ 
substantially homologous thereto; l e at iL^V" 
90% or about 95% at the nucleotide I^x ?T 
35 that the sequences disclosed h e ^ T Relieved 

aiscxosed herein characterize an HCV 
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subclass that is predominant in Japan and other Asian 
and/or Pacific rim countries • Additional Jl and J7 
isolates can be obtained in view of the disclosure herein 
and EPO Pub. No. 318,216. In particular, the Jl and J7 
5 nucleotide sequences disclosed herein, as well as the 
HCV1 sequences in Figure 12, can be used as primers or 
probes to clone additional domains of Jl, J7, or 
additional isolates. 

As used herein, a nucleotide sequence "from" a 

10 designated sequence or source refers to a nucleotide 
sequence that is homologous (i.e., identical) to or 
complementary to the designated sequence or source, or a 
portion thereof. The Jl sequences provided herein are a 
minimum of about 6 nucleotides, preferably about 8 

15 nucleotides, more preferably about 15 nucleotides, and 
most preferably 20 nucleotides or longer. The maximum 
length is the complete viral genome • 

In some aspects of the invention, the sequence 
of the region from which the polynucleotide is derived is 

20 preferably homologous to or complementary to a sequence 
which is unique to an HCV genome or the Jl and J7 genome. 
Whether or not a sequence is unique to a genome can be 
determined by techniques known to those of skill in the 
art. For example, the sequence can be compared to 

25 sequences in databanks, e.g., Genebank, to determine 
whether it is present in the uninfected host or other 
organisms. The sequence can also be compared to the 
known sequences of other viral agents, including those 
which are known to induce hepatitis, e.g., HAV, HBV, and 

30 HDV, and to other members of the Flaviviridae. The 
correspondence or non-correspondence of the derived 
sequence to other sequences can also be determined by 
hybridization under the appropriate stringency condi- 
tions. Hybridization techniques for determining the 

35 complementarity of nucleic acid sequences are known in 
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3 Hybridization can be determine v, , 

SX that specifically digeL sin^trendedT' ^ " 
duplex polynucleotides „ - branded area. In 

» regions encoding specie VZnV^ 
non- tranaoribed and/or ^.^^ re £ n » 

»ne Ji of J7 polynucleotide is nn< - - 
Physically derived from the nucleotide L *——**>* 
but may be generated !„ »- nucl «<*ide sequence shown, 

IS examnl. * eB " rat * d in "anner, including for 
example, chemical synthesis or DHA reDlia.*/ 
transcription or transcription. ^ Xttn " 
combinations of regions corresponding to of the 

designated sequence may be .od.,,^ , 1 the 

«« to be consists vi £*?tl * - ^ tte 

«• Polynucleotide, may a^ Ti^l ^T"Zr^\ ! 
which «e to own to thos. „f ^V«" 

poiypeptidrorstrr: t ^:r; a deai9nat8d 

amino acid sequence is h » ~ «" 
* the sequence of the ^Z^T" 1 *° 
thereof. An amino ae <* Polypeptide, or a portion 

nucleic acid stencil feTT T ^ d88l9nated 
-imo acid sequle id^l ^"^^ - 

30 rr * - • ^ ^f°x p :t 

3 o J7 amxno acid sequences in , ™ e J1 °r 

B4«ences in the polypeptides of 
present invention are at \_ px:iaes of the 

>«. Preferably at ^ ?f* *» 

preferably at least about 15 amino acil ^ 
preferably at least about 30 amino ac"£ 
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The polypeptides of the present invention are 
not necessarily translated from a designated nucleic acid 
sequence; the polypeptides may be generated in any 
manner, including for example, chemical synthesis, or 
5 expression of a recombinant expression system, or 

isolation from virus. The polypeptides may include one 
or more analogs of amino acids or unnatural amino acids. 
Methods of inserting analogs of amino acids into a 
sequence are known in the art. The polypeptides may also 

10 include one or more labels, which are known to those of 
skill in the art. 

The term "recombinant polynucleotide" as used 
herein intends a polynucleotide of genomic, cDNA, 
semisynthetic, or synthetic origin which, by virtue of 

15 its origin or manipulation: (1) is linked to a 

polynucleotide other than that to which it is linked in 
nature, or (2) does not occur in nature. 

The term "polynucleotide 11 as used herein refers 
to a polymeric form of nucleotides of any length, either 

20 ribonucleotides or deoxyribonucleotides . This term 
refers only to the primary structure of the molecule. 
Thus, this term includes double- and single-stranded DNA, 
and RNA. It also includes known types of modifications, 
for example, labels which are known in the art, 

25 methylation, "caps", substitution of one or more of the 
naturally occurring nucleotides with an analog, 
internucleotide modifications such as, for example, those 
with uncharged linkages (e.g., methyl phosphonates, 
phosphotriesters, phosphoamidates , carbamates, etc.) and 

30 with charged linkages (e.g., phosphorothioates, 

phosphorodithioates , etc.), those containing pendant 
moieties, such as, for example proteins (including for 
e.g., nucleases, toxins, antibodies, signal peptides, 
poly-L- lysine, etc.), those with intercalators (e.g., 

35 acridine, psoralen, etc.), those containing chelators 
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Purified polynucleotide" refers to » 

L ; reified polynucleotide. 

Purified polypeptide" refor 0 
comprising , specified P^lLptid. t h a t " r"' E "" ,iti<,n 
free of other components t ^ <"*«antially 

prising at least a^ ,Tof^: tl0n 
» Solypaptlde, BO re typically""^ £ U T m 

— *• - - the specified polypeptide ' " % " 

"cells" ^TT^ MUS "' " hoSt 

. ' cel1 l">es«, » e eii cultures" an* „.>. 

terms denote microorganisms or hl^ ' ! "* 0h 

-er transfer i^SS.^ ~ - 

25 not necessarily be cl^IT , * parental may 

* genomic or ^^^2^^" ~* - 



30 



»i« ""Plieon" is any genetic element e a » 

Plasmxd, a chromosome, « virus a co .«, m ^ ! 
behaves as a » viru s, a cosmid, etc. that 

tJ^J? venomous unit of polynucleotide 
replication within a call • 4 . 

its own cent 1 -" * "Plication 



35 



transfer,. » " C ^ 0nln, VeCt ° r " U a r< * U< »» that can 
transform a selected host cell s»d ln wnlch 
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polynucleotide segment is attached, so as to bring about 
the replication and/or expression of the attached 
segment. Typically, cloning vectors include plasmids, 
virus (e.g., bacteriophage vector) and cosraids. 
5 An "integrating vector" is a vector that does 

not behave as a replicon in a selected host cell, but has 
the ability to integrate into a replicon (typically a 
chromosome) resident in the selected host to stably 
transform the host. 

An "expression vector" is a construct that can 
transform a selected host cell and provides for 
expression of a heterologous coding sequence in the 
selected host. Expression vectors can be either a 
cloning vector or an integrating vector. 

A "coding sequence" is a polynucleotide 
sequence which is transcribed into mRNA and/or translated 
into a polypeptide when placed under the control of 
appropriate regulatory sequences. The boundaries of the 
coding sequence are determined by a translation start 
codon at the 5 '-terminus and a translation stop codon at 
the 3 '-terminus. A coding sequence can include, but is 
not limited to mRNA, cDNA, and recombinant polynucleotide 
sequences . 

"Control sequence" refers to polynucleotide 
regulatory sequences which are necessary to effect the 
expression of coding sequences to which they are ligated. 
The nature of such control sequences differs depending 
upon the host organism. In prokaryotes, control 
sequences generally include promoter, ribosomal binding 
site, and terminators. In eukaryotes generally control 
sequences include promoters, terminators and, in some 
instances, enhancers. The term "control sequences" is 
intended to include, at a minimum, all components the 
presence of which are necessary for expression, and may 
also include additional advantageous components. 
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wherein "^"^ " £ers *• * Juxtaposition 

o^tti^T^ = ° deSCribed «• *» • relationship 
permitting them to function in their intended manner A 

is Ugated i» such a way that expression of the coding 
sequ.no. is achieved und.r conditions compatible with the 
control sequences. 

An "open reading frame- or our is a region of a 

10 r»r Cle ° tida ' aqUenC * *** * nC ° a * 8 a ^««Pti<ie; this 
10 region may represent a portion of a coding sequence or a 
total coding sequence. 

"Immunologically cross-reactive- refers to two 
or more epitopes or polypeptides that are bound by the 
same antibody. Cross-reactivity can be determined by any 
15 of a number of immunoassay techniques, such as a 

competition assay. 

nolvn^-/ 3 h6rein ' ^ tera Mantib <*Y" refers to a 

polypeptide or group of polypeptides which comprise at 

20 iT JT" ePi ? OPe ' *" " antigen bindin * site " *■ formed 
20 fron the folding of the variable domains of an antibody 

molecule( s > to form three-dimensional binding sites with 
an internal surface shape and charge distribution 
complementary to the features of an epitope of an 
antigen, which allows specific binding to form an 

fo^L°T antigen C ° mPleX - ^ anti ^ binding site may be 
formed from a heavy- and/or light-chain domain (VH and 

™„ t nrr\ 1Vely) ' WhiCh f0rm h ^-iable loops which 
contribute to antigen binding. The term "antibody. 

includes, without limitation, chimeric antibodies, 

!nfT f"*^'"***** antibodies, Pab proteins, 
and single-domain antibodies, m many cases, the biding 
Phenomena of antibodies to antigens is equivalent to 
other ligand/anti-iigand binding. 

35 < AS US6d hereln ' a " sin 9le domain antibody" 

J3 (dAb) is an antibody which r^^^-i ^ * 

««y wnicn is comprised of an HL domain, 
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which binds specifically with a designated antigen, a 
dAb does not contain a VL domain, but may contain other 
antigen binding domains known to exist to antibodies, for 
example, the kappa and lambda domains. Methods for 
5 preparing dAbs are known in the art. See, for example, 
Ward et al, Nature Hi: 544 (1989). 

Antibodies may also be comprised of VH and vl 
domains, as well as other known antigen binding domains 
Examples of these types of antibodies and methods for 
10 their preparation and known in the art (see, e.g., u. s . 
Patent No. 4,816,467, which is incorporated herein by 
reference) , and include the following. For example 

vertebrate antibodies" refers to antibodies which are 
tetramers or aggregates thereof, comprising light and 
15 heavy chains which are usually aggregated in a »y« 
configuration and which may or may not have covalent 
linkages between the chains, m vertebrate antibodies, 
the amino acid sequences of the chains are homologous 
with those sequences found in antibodies produced in 
vertebrates, whether in situ or in vitro (for example, in 

p^fLT S> ; I ertebrate -tibodies include, for example, 
purifxed polyclonal antibodies and monoclonal antibodies 
methods for the preparation of which are described infra 

25 a . " Hybrid antibodiesM antibodies where chains 

25 are separately homologous with reference to mammalian 
antxbody chains and represent novel assemblies of them 
so that two different antigens are precipitable by the' 
tetramer or aggregate, m hybrid antibodies, one pair of 
heavy and light chains are homologous to those found in 
30 an antibody raised against a first antigen, while a 

^tiT/ air ° f ° hainS a " hottolo *° us to those found in an 
antibody raxsed against a second antibody. This results 
in the property of "divalence" , i.e., the ability to bind 
two antigens simultaneously, such hybrids may also be 
35 formed using chimeric chains, as set forth below 
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"Chimeric antibodies" refers to antibodies in 
which the heavy and/or light chains are fusion proteins. 
Typically, one portion of the amino acid sequences of the 
chain is homologous to corresponding sequences in an 
5 antibody derived from a particular species or a 

particular class, while the remaining segment of the 
chain is homologous to the sequences derived from another 
species and/ or class. Usually, the variable region of 
both light and heavy chains mimics the variable regions 

10 or antibodies derived from one species of vertebrates, 
while the constant portions are homologous to the 
sequences in the antibodies derived from another species 
of vertebrates. However, the definition is not limited 
to this particular example. Also included is any 

15 antibody in which either or both of the heavy or light 
chains are composed of combinations of sequences 
mimicking the sequences in antibodies of different 
sources, whether these sources be from differing classes 
or different species of origin, and whether or not the 

20 fusion point is at the variable/constant boundary. Thus, 
it is possible to produce antibodies in which neither the 
constant nor the variable region mimic know antibody 
sequences, it then becomes possible, for example, to 
construct antibodies whose variable region has a higher 

25 specific affinity for a particular antigen, or whose 

constant region can elicit enhanced complement fixation, 
or to make other improvements in properties possessed by 
a particular constant region. 

Another example is "altered antibodies", which 

30 refers to antibodies in which the naturally occurring 
amino acid sequence in a vertebrate antibody has been 
varies. Utilizing recombinant DNA techniques, antibodies 
can be redesigned to obtain desired characteristics. The 
possible variations are many, and range from the changing 

35 of one or more amino acids to the complete redesign of a 
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region, for example, the constant region. Changes in the 
constant region, in general, to attain desired cellular 
process characteristics, e.g., changes in complement 
fixation, interaction with membranes, and other effector 
5 functions. Changes in the variable region may be made to 
alter antigen binding characteristics. The antibody may 
also be engineered to aid the specific delivery of a 
molecule or substance to a specific cell or tissue site 
The desired alterations may be made by known techniques 
10 in molecular biology, e.g., recombinant techniques, site- 
directed mutagenesis, etc. 

Yet another example are "univalent antibodies" 
which are aggregates comprised of a heavy-chain/ light- 
chain dimer bound to the Fc (i.e., stem) region of a 
second heavy chain. This type of antibody escapes 
antigenic modulation. See, e.g., Glennie et al. Nature 
125: 712 (1982) . Included also within the definition of 
antibodies are "Fab" fragments of antibodies. The "Fab" 
region refers to those portions of the heavy and light 
chains which are roughly equivalent, or analogous, to the 
sequences which comprise the branch portion of the heavy 
and light chains, and which have been shown to exhibit 
immunological binding to a specified antigen, but which 
lack the effector Fc portion. "Fab" includes aggregates 
of one heavy and one light chain (commonly known as 
Fab'), as well as tetramers containing the 2H and 2L 
chains (referred to as F(ab) 2 ), which are capable of 
selectively reacting with a designated antigen or antigen 
family. Pa b antibodies may be divided into subsets 
analogous to those described above, i.e., "vertebrate 
Fab", "hybrid Fab", "chimeric Fab", and "altered Fab" 
Methods of producing Fab fragments of antibodies are 
known within the art and include, for example, 
proteolysis, and synthesis by recombinant techniques. 
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" Epitope" refers to an antibody binding site 
usually defined by a polypeptide, but also by non-amino 
acid haptens. An epitope could comprise 3 amino acids 
in a spatial conformation which is unique to the epitope, 
5 generally an epitope consists of at least 5 such amino 
acids, and more usually, consists of at least 8-10 sugh 

amino acids. 

"Antigen-antibody complex 1 * refers to the 
complex formed by an antibody that is specifically bound 
10 to an epitope on an antigen* 

"Immunogenic polypeptide" refers to a 
polypeptide that elicits a cellular and/ or humoral immune 
response in a mammal, whether alone or linked to a 
carrier, in the presence or absence of an adjuvant. 

15 "Polypeptide" refers to a polymer of amino 

acids and does not refer to a specific length of the 
molecule. Thus, peptides, oligopeptides, and proteins 
are included within the definition of polypeptide* This 
term also does not refer to or exclude post-expression 

20 modifications of the polypeptide, for example, 

glycosy lations , acetylations, phosphorylations and the 
like. Included within the definition are, for example, 
polypeptides containing one or more analogs of an_ amino 
acid (including, for example, unnatural amino acids, 

25 etc.)/ polypeptides with substituted linkages, as well as 
other modifications known in the art, both naturally 
occurring and non-naturally occurring. 

"Transformation", as used herein, refers to the 
insertion of an exogenous polynucleotide into a host 

30 cell, irrespective of the method used for the insertion, 
for example, direct uptake, transduction, f -mating or 
electroporation • The exogenous polynucleotide may be 
maintained as a non- integrated vector, for example, a 
plasmid, or alternatively, may be integrated into the 

35 host genome. 
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A "transformed" host cell refers to both the 
immediate cell that has undergone transformation and its 
progeny that maintain the originally exogenous 
polynucleotide . 

5 "Treatment" as used herein refers to 

prophylaxis and/or therapy. 

"Individual", refers to vertebrates, 
particularly members of the mammalian species, and 
includes but is not limited to domestic animals, sports 
animals, and primates, including humans. 

"Sense strand" refers to the strand of a 
double-stranded DNA molecule that is homologous to a mRNA 
transcript thereof. The "anti-sense strand" contains a 
sequence which is complementary to that of the "sense 
15 strand" . 

"Antibody-containing body component" refers to 
a component of an individual's body which is a source of 
the antibodies of interest. Antibody-containing body 
components are known in the art, and include but are not 
limited to, whole blood and components thereof, plasma 
serum, spinal fluid, lymph fluid, the external sections 
of the respiratory, intestinal, and genitourinary tracts 
tears, saliva, milk, white blood cells, and myelomas. ' 

"Purified HCV" isolate refers to a preparation 
of HCV particles which has been isolated from the cel- 
lular constituents with which the virus is normally as- 
sociated, and from other types of viruses which may be 
present in the infected tissue. The techniques for 
isolating viruses are known to those of skill in the art 
and include, for example, centrifugation and affinity 
chromatography . 

An HCV "particle" is an entire virion, as well 
as particles which are intermediates in virion formation 
HCV particles generally have one or more HCV proteins 
associated with the HCV nucleic acid. 
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-Probe.' refers to a polynucleotide which forms 
a hybrid structure with a sequence in a target 
polynucleotide, due to complementarity of at least one 
region in the probe with a region in the target. 
5 "Biological sample" refers to a sample of 

tissue or fluid isolated from an individual, including 
but not limited to, for example, whole blood and 
components thereof, plasma, serum, spinal fluid, lymph 
fluid the external sections of the skin, respiratory, 
10 intestinal, and genitourinary tracts, tears, saliva 

■ilk, blood cells, tumors, organs, and also samples'of & 
Xito cell culture constituents (including but not 
limited to conditioned medium resulting from the growth 
of cells in cell culture medium, putatively virally 
infected cells, recombinant cells, and cell components). 

The invention pertains to the isolation and 
characterization of a newly discovered isolate of HCV, ji 
and J7, their nucleotide sequences, their protein 

20 ISrT T reSUlt±ng POl r nUCleotid -< Polypeptides and 
20 antibodies derived therefrom, isolates ji and J7 are 

novel in their nucleotide and amino acid sequences, and 

al t0 ° haracteristic * HCV isolates, from Japan 

and other Asian countries. 

25 and ncJ** nUCle ° tide se ^nces ^rived from HCV/Ji 

™ ^ US6fUl aS ~ *° diagnose the presence 
of virus in samples, and to isolate other naturally oc- 
curring variants of the virus. These nucleotide 
sequences also make available polypeptide sequences of 

30 IZT !! nS enC ° ded Within thS J1 and J7 and 

permits the production of polypeptides which are useful 

as standards or reagents in diagnostic tests and/or as 
components of vaccines. Antibodies, both polyclonal and 
monoclonal, directed against HCV epitopes contained 
within these polypeptide sequences are also useful for 
35 diagnostic tests, as therapeutic agents, for screening of 
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antiviral agents, and for the isolation of the NANBH 
virus. In addition, by utilizing probes derived from the 
sequences disclosed herein it is possible to isolate and 
sequence other portions of the Ji and J7 genome, thus 
5 giving rise to additional probes and polypeptides which 
are useful in the diagnosis and/or treatment, both 
prophylactic and therapeutic, of NANBH. 

The availability of the HCV/JI and HCV/J7 
nucleotide sequences enable the construction of 
10 polynucleotide probes and polypeptides useful in 

diagnosing NANBH due to HCV infection and in screening 
blood donors as well as donated blood and blood products 
for infection. For example, from the sequences it is 
possible to synthesize DNA oligomers of about 8-10 
nucleotides, or larger, which are useful as hybridization 
probes to detect the presence of HCV RNA in, for example, 
sera of subjects suspected of harboring the virus, or for 
screening donated blood for the presence of the virus. 
The HCV/Ji and HCV/J7 sequences also allow the design and 
production of HCV specif ic polypeptides which are useful 
as diagnostic reagents for the presence of antibodies 
raised during NANBH. Antibodies to purified polypeptides 
derived from the HCV/Ji and HCV/J7 sequences may also be 
used to detect viral antigens in infected individuals and 
25 in blood. 

Knowledge of these HCV/Ji and HCV/J7 sequences 
also enable the design and production of polypeptides 
which may be used as vaccines against HCV and also for 
the production of antibodies, which in turn may be used 
for protection against the disease, and/or for therapy of 
HCV infected individuals. Moreover, the disclosed HCV/Ji 
and HCV/J7 sequences enable further characterization of 
the HCV genome. Polynucleotide probes derived from these 
sequences, as well as from the HCV genome, may be used to 
35 screen cDNA libraries for additional viral cDNA 
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sequences, which, in turn, may be used to obtain 
additional overlapping sequences. See, e.g., EPO Pub. 
No. 318,216. 

The HCV/Jl and HCV/ J7 polynucleotide sequences, 
5 the polypeptides derived therefrom and the antibodies 
directed against these polypeptides, are useful in the 
isolation and identification of the BB-NANBV agent (s) . 
For example, antibodies directed against HCV epitopes 
contained in polypeptides derived from the HCV/Jl 

10 sequences may be used in processes based upon affinity 

chromatography to isolate the virus. Alternatively, the 
antibodies may be used to identify viral particles 
isolated by other techniques. The viral antigens and the 
genomic material within the isolated viral particles may 

15 then be further characterized. 

The information obtained from further 
sequencing of the HCV/Jl and HCV/J7 genome, as well as 
from further characterization of the HCV/Jl and HCV/J7 
antigens and characterization of the genomes enable the 

20 design and synthesis of additional probes and 

polypeptides and antibodies which may be used for 
diagnosis, for prevention, and for therapy of HCV induced 
NANBH, and for screening for infected blood and 
blood-related products. 

25 The availability of HCV/Jl and HCV/J7 cDNA 

sequences permits the construction of expression vectors 
encoding antigenically active regions of the polypeptide 
encoded in either strand. These antigenically active 
regions may be derived from coat or envelope antigens or 

30 from core antigens, or from antigens which are 

non-structural including, for example, polynucleotide 
binding proteins, polynucleotide polymerase (s) , and other 
viral proteins required for the replication and/ or as- 
sembly of the virus particle. Fragments encoding the 

35 desired polypeptides are derived from the cDNA clones 
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using conventional restriction digestion or by synthetic 
methods, and are ligated into vectors which may, for 
example, contain portions of fusion sequences such as 
beta-galactosidase or superoxide dismutase (SOD) . Methods 
5 and vectors which are useful for the production of 

polypeptides which contain fusion sequences of SOD are 
described in EPO Pub. No. 196,056. Vectors encoding 
fusion polypeptides of SOD and HCV polypeptides are 
described in EPO Pub. No. 318,216. Any desired portion 
of the HCV cDNA containing an open reading frame, in 
either sense strand, can be obtained as a recombinant 
polypeptide, such as a mature or fusion protein. 
Alternatively, a polypeptide encoded in the cDNA can be 
provided by chemical synthesis. 

The DNA encoding the desired polypeptide, 
whether in fused or mature form, and whether or not 
containing a signal sequence to permit secretion, may be 
ligated into expression vectors suitable for any 
convenient host. Both eukaryotic and prokaryotic host 
systems are presently used in forming recombinant 
polypeptides, and a summary of some of the more common 
control systems and host cell is given below. The 
polypeptide produced in such host cells is then isolated 
from lysed cells or from the culture medium and purified 
to the extent needed for its intended use. Purification 
may be by techniques known in the art, for example, 
differential extraction, salt fractionation, 
chromatography on ion exchange resins, affinity 
chromatography, centrifugation, and the like. See, for 
example, Methods in EnzvuioTo/fY for a variety of methods 
for purifying proteins. 

Such recombinant or synthetic HCV polypeptides 
can be used as diagnostics, or those which give rise to 
neutralizing antibodies may be formulated into vaccines. 
Antibodies raised against these polypeptides can also be 
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used as diagnostics, or for passive immunotherapy. In 
addition, antibodies to these polypeptides are useful for 
isolating and identifying HCV particles. 

The HCV antigens may also be isolated from HCV 
5 virions. The virions may be grown in HCV infected cells 
in tissue culture, or in an infected host. 

While the polypeptides of the present invention 
may comprise a substantially complete viral domain, in 
many applications all that is required is that the 

10 polypeptide comprise an antigenic or immunogenic region 
of the virus. An antigenic region of a polypeptide is 
generally relatively small — typically 8 to 10 amino acids 
or less in length. Fragments of as few as 5 amino acids 
may characterize an antigenic region. These segments may 

15 correspond to regions of HCV/Jl or HCV/J& epitopes. Ac- 
cordingly, using the cDNAs of HCV/Jl and HCV/J7 as a 
basis, DNAs encoding short segments of HCV/Jl and HCV/J7 
polypeptides can be expressed recombinantly either as 
fusion proteins, or as isolated polypeptides. In addi- 

20 tion, short amino acid sequences can be conveniently 
obtained by chemical synthesis. 

In instances wherein the synthesized 
polypeptide is correctly configured so as to provide the 
correct epitope, but is too small to be immunogenic, the 

25 polypeptide may be linked to a suitable carrier. A 

number of techniques for obtaining such linkage are known 
in the art, including the formation of disulfide linkages 
using N-succinimidyl-3-(2-pyridyl-thio) propionate (SPDP) 
and succinimidyl 4-(N-maleimido-methyl)cyclohexane- 

30 1-carboxylate (SMCC) obtained from Pierce Company, 

Rock ford, Illinois, (if the peptide lacks a sulfhydryl 
group, this can be provided by addition of a cysteine 
residue.) These reagents create a disulfide linkage 
between themselves and peptide cysteine residues on one 

35 protein and an amide linkage through the epsilon-amino on 
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a lysine, or other free amino group in the other. A 
variety of such disulfide/ amide-forming agents are known. 
See, for example, Immun. Rev. (1982) j&:185. Other 
bifunctional coupling agents form a thioether rather than 
5 a disulfide linkage. Many of these thio-ether-forming 
agents are commercially available and include reactive 
esters of 6-maleimidocaproic acid, 2-bromoacetic acid, 
2-iodoacetic acid, 4-(N-maleimido-methyl)cyclohexane- 
l-carboxylic acid, and the like. The carboxyl groups can 

10 be activated by combining them with succinimide or 
l-hydroxyl-2-nitro-4 -sulfonic acid, sodium salt. 
Additional methods of coupling antigens employs the 
rotavirus/"binding peptide" system described in EPO Pub. 
No. 259,149, the disclosure of which is incorporated 

15 herein by reference. The foregoing list is not meant to 
be exhaustive, and modifications of the named compounds 

can clearly be used. 

Any carrier may be used which does not itself 
induce the production of antibodies harmful to the host. 

20 Suitable carriers are typically large, slowly metabolized 
macromolecules such as proteins; polysaccharides, such as 
latex functionalized sepharose, agarose, cellulose, cel- 
lulose beads and the like; polymeric amino acids, such as 
polyglutamic acid, polylysine, and the like; amino acid 

25 copolymers; and inactive virus particles. Especially 
useful protein substrates are serum albumins, keyhole 
limpet hemocyanin, immunoglobulin molecules, 
thyroglobulin, ovalbumin, tetanus toxoid, and other 
proteins well known to those skilled in the art. 

30 In addition to full-length viral proteins, 

polypeptides comprising truncated HCV amino acid 
sequences encoding at least one viral epitope are useful 
immunological reagents. For example, polypeptides 
comprising such truncated sequences can be used as 

35 reagents in an immunoassay. These polypeptides also are 
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candidate subunit antigens i„ compositions for antiserum 
production or vaccines. While these truncated sequences 
can be produced by various known treatments of native 
viral protein, it is generally preferred to make 
5 synthetic or recombinant polypeptides comprising an HCV 
sequence. Polypeptides comprising these truncated HCV 
sequences can be made up entirely of HCV sequences (one 
or more epitopes, either contiguous or noncontiguous), or 
HCV sequences and heterologous sequences in a fusion 
protein. Useful heterologous sequences include sequences 
that provide for secretion from a recombinant host, 
enhance the immunological reactivity of the HCV 
epitopes,, or facilitate the coupling of the polypeptide 
to an immunoassay support or a vaccine carrier, see 
15 e^g., EPO Pub. No . 116/201; ^ ^ ^ ' 

EPO Pub. No. 259,149; U.S. Pat. No. 4,629,783, the dis- 
closures of which are incorporated herein by reference. 

The size of polypeptides comprising the 
20 *™ Cated HCV 8e ^ces can vary widely, the minimum size 
being a sequence of sufficient size to provide an HCV 

^° P V hile naXimWa SiZe iS n0t «^ical. m some 
applications, the maximum size usually is not 

substantially greater than that required to provide the 
desired HCV epitopes and function (s, of the heterologous 
25 sequence, if any . Typically, the truncated HCV amino 
acid sequence will range from ^ 5 ^ ^ ^ 

acids in length. More typically, however, the HCV 
sequence will be a maximum of about 50 amino acids in 
length, preferably a maximum of about 30 amino acids, it 

aLuH t0 HCV « at least 

about 10, 12 or is amino acids, up to a maximum of about 
20 or 25 amino acids. * 
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Truncated HCV amino acid sequences comprising 
epitopes can be identified in a number of ways. For 
example, the entire viral protein sequence can be 
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screened by preparing a series of short peptides that 
together span the entire protein sequence. By starting 
with, for example, 100-mer polypeptides, it would be 
routine to test each polypeptide for the presence of 
5 epitope (s) showing a desired reactivity, and then testing 
progressively smaller and overlapping fragments from an 
identified 100-mer to map the epitope of interest. 
Screening such peptides in an immunoassay is within the 
skill of the art. It is also known to carry out a 

10 computer analysis of a protein sequence to identify 
potential epitopes, and then prepare oligopeptides 
comprising the identified regions for screening. It is 
appreciated by those of skill in the art that such 
cpmputer analysis of antigenicity does not always 

15 identify an epitope that actually exists, and can also 
incorrectly identify a region of the protein as 
containing an epitope. 

The observed relationship of the putative 
polyproteins of HCV and the Flavi viruses allows a 

20 prediction of the putative domains of the HCV 

"non-structural" (NS) proteins. The locations of the 
individual NS proteins in the putative Flavivirus 
precursor polyprotein are fairly well-known. Moreover, 
these also coincide with observed gross fluctuations in 

25 the hydrophobicity profile of the polyprotein. It is 

established that NS5 of Flaviviruses encodes the virion 
polymerase, and that NSl corresponds with a complement 
fixation antigen which has been shown to be an effective 
vaccine in animals. Recently, it has been shown that a 

30 flaviviral protease function resides in NS3. Due to the 
observed similarities between HCV and the Flaviviruses, 
deductions concerning the approximate locations of the 
corresponding protein domains and functions in the HCV 
polyprotein are possible. Figure 11 is a schematic of 

35 putative domains of the HCV polyprotein. The expression 
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of polypeptides containing these domains in a variety of 
recombinant host cells, including, for example, bacteria, 

* 

yeast, insect, and vertebrate cells, should give rise to 
important immunological reagents which can be used for T 
5 diagnosis, detection, and vaccines. 

Although the non-structural protein region of * 
the putative polyproteins of the HCV isolate described 
herein and of Flaviviruses appears to be generally 
similar, there is less similarity between the putative 

10 structural regions which are towards the N-terminus. In 
this region, there is a greater divergence in sequence, 
and in addition, the hydrophobic profile of the two 
regions show less similarity. This "divergence" begins 
in the N-terminal region of the putative NS1 domain in 

15 HCV, and extends to the presumed N-terminus. 

Nevertheless, it is still possible to predict the 
approximate locations of the putative nucleocapsid 
(N-terminal basic domain) and E (generally hydrophobic) 
domains within the HCV polyprotein. From these 

20 predictions it may be possible to identify approximate 

regions of the HCV polyprotein that could correspond with 
useful immunological reagents. For example, the E and 
NS1 proteins of Flaviviruses are known to have efficacy 
as protective vaccines. These regions, as well as some 

25 which are shown to be antigenic in the HCVl, for example 
those within putative NS3, C, and NS5, etc., should also 
provide diagnostic reagents. 

The immunogenicity of the HCV sequences may 
also be enhanced by preparing the sequences fused to or 

30 assembled with particle-forming proteins such as, for 
example, hepatitis B surface antigen or rotavirus VP6 
antigen. Constructs wherein the HCV epitope is linked * 
directly to the particle-forming protein coding sequences 
produce hybrids which are immunogenic with respect to the * 

35 HCV epitope. In addition, all of the vectors prepared 
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include epitopes specific to HBV, having various degrees 
of immunogenic ity, such as, for example, the pre-S 
peptide. Thus, particles constructed from particle form- 
ing protein which include HCV sequences are immunogenic 
5 with respect to HCV and particle-form protein. See, 

e.g., U.S. Pat. No. 4,722,840; EPO Pub No. 175,261; EPO 
Pub. No. 259,149; Michelle et al. (1984) Int. Symposium 
on Viral Hepatitis. 

Vaccines may be prepared from one or more im- 

10 munogenic polypeptides derived from HCV/J1 or HCV/J7. 
The observed homology between HCV and Flaviviruses 
provides information concerning the polypeptides which 
are likely to be most effective as vaccines, as well as 
the regions of the genome in which they are encoded. The 

15 general structure of the Flavivirus genome is discussed 

in Rice et al. (1986) in THE VIRUSES: THE TOGAVIRIDAE AND 
/ i; FLAVTVIRIDAE (Series eds. Fraenkel-Conrat and Wagner, Vol 

eds. Schlesinger and Schlesinger, Plenum Press). The 
flavivirus genomic RNA is believed to be the only 
;: 20 virus-specific mRNA species, and it is translated into 
V/vf the three viral structural proteins, i.e., C, M, and E, 

as well as two large nonstructural proteins, NV4 and NV5, 
and a complex set of smaller nonstructural proteins. It 
is known that major neutralizing epitopes for 

25 Flaviviruses reside in the E (envelope) protein. Roehrig 
(1986) in THE VIRUSES: THE TOGAVIRIDAE AND FLAVIVIRIDAE 
(Series eds. Fraenkel-Conrat and Wagner, Vol eds. 
Schlesinger and Schlesinger, Plenum Press) . The 
corresponding HCV E gene and polypeptide encoding region 

30 may be predicted, based upon the homology to 

Flaviviruses. Thus, vaccines may be comprised of re- 
combinant polypeptides containing epitopes of HCV E. 
These polypeptides may be expressed in bacteria, yeast, 
or mammalian cells, or alternatively may be isolated from 

35 viral preparations. It is also anticipated that the 
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other structural proteins may also contain epitopes which 

gxye rise to protective anti-HCV antibodies. Thus 
polypeptides containing the epitopes of E, c, and M may 

s i::^ s ' whether singiy ° r in *» «* 

in addition to the above, it has been shown 
that xmmunxzation with nsi (nonstructural protein 1) 
results in protection against yellow fever. Schlesinger 

10 H (1986) . J * Vir01 ' £fl:1153 ' ^ is *™ -en 2£ 
10 1*e immunxzatxon does not give rise to neutralizing 

antibodies. Thus, particularly since this protein 
appears to be highly conserved among Flaviviruses, it is 
likely that hcv nsi win »i e « k. ^ , 

infection. Moreover it al" S L T 
15 reove r, it also shows that nonstructural 

15 proteins may provide protection against viral 

pathogenicity, even if they do not cause the production 
of neutralizing antibodies. Production 

In view of the above, multivalent vaccines 
. agaxnst HCV may be comprised of one or more epitopes from 
20 one or more structural proteins a „fl,„, B P"°Pes from 
en ^ nnoo . protexns, and/or one or more 

epxtopes from one or more nonstructural proteins Th eso 
vaccines may be comDrisod n * * Protexns. These 

hcv ™i I.r mpriSed of ' for example, recombinant 
HCV polypeptides and/or polypeptides isolated from the 

25 i:: Dr i s : in particuiar - vacci - s «• contemplat:: 

s^unxTa 9 t° ne " B ° re ° f ^ f ° ll0Wing HCV P- te -s, or 
subunxt antxgens derived therefrom: E , NSI, c NS2 W 

an V S5 ' Particularly preferred are vaccLs ' ' 
comprxsing E and/or NS1 , or subunits thereof. In 
addition, it may be possible to use inactivated HCV in 

30 vaccines; inactivation may be by the preparation Z viral 
lysates, or by other means Known in the art to cause 
activation of Poxviruses, for example, treaL^t with 
organic solvents or detergents, or treatment with 
formalin. Moreover, vaccines may also be prepared from 

35 attenuated HCV strains or from hybrid viruses s"ch as 
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vaccinia vectors known in the art [Brown et al. Nature 
319 : 549-550 (1986)]. 

The preparation of vaccines which contain 
immunogenic polypeptide (s) as active ingredients is known 
5 to one skilled in the art. Typically, such vaccines are 
prepared as injectables, either as liquid solutions or 
suspensions; solid forms suitable for solution in, or 
suspension in, liquid prior to injection may also be 
prepared. The preparation may also be emulsified, or the 

10 protein encapsulated in liposomes. The active 

immunogenic ingredients are often mixed with excipients 
which are pharmaceutical^ acceptable and compatible with 
the active ingredient. Suitable excipients are, for 
example, water, saline, dextrose, glycerol, ethanol, or 

15 the like and combinations thereof. In addition, if 
desired, the vaccine may contain minor amounts of 
auxiliary substances such as wetting or emulsifying 
agents, pH buffering agents, and/or adjuvants which 
enhance the effectiveness of the vaccine. Examples of 

20 adjuvants which may be effective include but are not 
limited to: aluminum hydroxide, N-acetyl-muramyl-L- 
threonyl-D-isoglutamine (thr-MDP) , N-acetyl-nor-muramyl- 
L-alanyl-D-isoglutamine (CGP 11637, referred to as 
nor-MDP) , N-acetylmurarayl-L-alanyl-D-isoglutaminyl- 

25 L-alanine-2- (1 ' -2 ' -dipalmitoyl-sn-glycerol-3- 

hydroxyphosphoryloxy) -ethylamine (CGP 19835A, referred to 
as MTP-PE) , and RIBI, which contains three components 
extracted from bacteria, monophosphoryl lipid A, 
trehalose dimycolate and cell wall skeleton (MPL+TDM+CWS) 

30 in a 2% squalene/Tween 80 emulsion. The effectiveness of 
an adjuvant may be determined by measuring the amount of 
antibodies directed against an immunogenic polypeptide 
containing an HCV antigenic sequence resulting from 
administration of this polypeptide in vaccines which are 

35 also comprised of the various adjuvants. 
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The vaccines are conventionally administered 

T 

parenterally, by injection, usually, either subcutane- 
ously or intramuscularly. Additional formulations which 
are suitable for other modes of administration include 
5 suppositories and, in some cases, oral formulations. For 
suppositories, traditional binders and carriers may 
include, for example, polyalkylene glycols or 
triglycerides; such suppositories may be formed from 
mixtures containing the active ingredient in the range of 

10 0.5% to 10%, preferably l%-2%. Oral formulations include 
such normally employed excipients as, for example, 
pharmaceutical grades of mannitol, lactose, starch, 
magnesium st ear ate, sodium saccharine, cellulose, 
magnesium carbonate, and the like. These compositions 

15 take the form of solutions, suspensions, tablets, pills, 
capsules, sustained release formulations or powders and 
contain 10%-95% of active ingredient, preferably 25%-70%. 

The proteins may be formulated into the vaccine 
as neutral or salt forms. Pharmaceutically acceptable 

20 salts include the acid addition salts (formed with free 
amino groups of the peptide) and which are formed with 
inorganic acids such as, for example, hydrochloric or 
phosphoric acids, or such organic acids such as acetic, 
oxalic, tartaric, maleic, and the like. Salts formed 

25 with the free carboxyl groups may also be derived from 
inorganic bases such as, for example, sodium, potassium, 
ammonium, calcium, or ferric hydroxides, and such organic 
bases as isopropylamine, trimethylamine, 2-ethylamino 
ethanol, histidine, procaine, and the like. 

30 The vaccines are administered in a manner 

compatible with the dosage formulation, and in such 
amount as will be prophylactically and/or therapeutically 
effective. The quantity to be administered, which is 
generally in the range of 5 micrograms to 250 micrograms 

35 of antigen per dose, depends on the subject to be 
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treated, capacity of the subject's immune system to 
synthesize antibodies, and the degree of protection 
desired. Precise amounts of active ingredient required 
to be administered may depend on the judgment of the 
5 practitioner and may be peculiar to each subject. 

The vaccine may be given in a single dose 
schedule, or preferably in a multiple dose schedule. A 
multiple dose schedule is one in which a primary course 
of vaccination may be with 1-10 separate doses, followed 

10 by other doses given at subsequent time intervals 
required to maintain and or reenforce the immune 
response, for example, at 1-4 months for a second dose, 
and if needed, a subsequent dose(s) after several months. 
The dosage regimen will also, at least in part, be 

15 determined by the need of the individual and be dependent 
upon the judgment of the practitioner. 

In addition, the vaccine containing the im- 
munogenic HCV antigen (s) may be administered in conjunc- 
tion with other immunoregulatory agents, for example, im- 

20 mune globulins. 

The immunogenic polypeptides prepared as 
described above are used to produce antibodies, both 
polyclonal and monoclonal. If polyclonal antibodies are 
desired, a selected mammal (e.g., mouse, rabbit, goat, 

25 horse, etc.) is immunized with an immunogenic polypeptide 
bearing an HCV epitope (s). Serum from the immunized 
animal is collected and treated according to known 
procedures. If serum containing polyclonal antibodies to 
an HCV epitope contains antibodies to other antigens, the 

30 polyclonal antibodies can be purified by immunoaff inity 
chromatography. Techniques for producing and processing 
polyclonal antisera are known in the art, see for 
example, Mayer and Walker, eds. (1987) IMMUNOCHEMICAL 
METHODS IN CELL AND MOLECULAR BIOLOGY (Academic Press, 

35 London) • 
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Monoclonal antibodies directed against HCV 
epitopes can also be readily produced by one skilled in 

T gSneral meth ° d0l0gy for » ak -g monoclonal 
antibodies by hybridomas is well known. Immortal 

5 antibody-producing cell lines can be created by cell fu- 
sion and also by other techniques such as direct trans- 
formation of B lymphocytes with oncogenic DNA or 
transfection with Epstein-Barr virus. See, e'g M 

10 fHf^r 1 ' (1980> HYBRID0MA PIQUES; Hamming 
10 et al. (1981) , MONOCLONAL ANTIBODIES AND T-CELL 

HYBRIDOMAS ; Kennett et al. (1980) MONOCLONAL ANTIBODIES • 
SSS also, U.S. Patent Nos. 4,341,761; 4,399 121- 
4,427,783; 4,444,887; 4,466,917; 4,472,500; 4,491,632- 

15 ^V'! 93 ' 890 * Panels <* monoclonal antibodies produced 
15 against HCV epitopes can be screened for various 

properties; i.e., for isotype, epitope affinity, etc. 

Antibodies, both monoclonal and polyclonal, 

which are directed against HCV epitopes are particularly 

20 ZlZl " di T SiS ' ^ th ° Se Wh±Ch ™ -utrali Z ing are 

r!aL" r VeiMnUn ° theraPy * Clonal antibodies, 
in particular, may be used to raise anti-idiotype 
antibodies. * 

>, ^i-^iotype antibodies are immunoglobulins 

25 Inf!ct Carry " Minternal ina9e " ° f th « anti ^ n "«* the 
infectious agent against which protection is desired. 

r^rt f ° s r anti " idi0tyPe are known 

in the art. See, e,g,, Qrzych (1985), Nature 316:74; 

TlZS * (1984) ' S ° ienCe ^ : " 25 ' ^aag et 

30 ILIZI Inm,Un01 * ^ :1225 ' The ^ anti-idiotype 

30 antibodies may also be useful for treatment and/or 

diagnosis of NANBH, as well as for an elucidation of the 
immunogenic regions of HCV antigens. 

Using the HCV/Ji or HCV/J7 polynucleotide 
sequences as a basis, oligomers of approximately 8 
35 nucleotides or more can be prepared, either by excision 
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or synthetically, which hybridize with the HCV genome and 
are useful in identification of the viral agent (s) , 
further characterization of the viral genome (s), as well 
as xn detection of the virus (es) in diseased individuals. 
5 The probes for HCV polynucleotides (natural or derived) 
are a length which allows the detection of unique viral 
sequences by hybridization. While 6-8 nucleotides may be 
a workable length, sequences of about 10-12 nucleotides 
are preferred, and about 20 nucleotides appears optimal. 
These probes can be prepared using routine methods, 
including automated oligonucleotide synthetic methods 
Among useful probes, for example, are the clones 
disclosed herein, as well as the various oligomers useful 
in probing cDNA libraries, set forth below, a complement 
to any unique portion of the HCV genome will be satisfac- 
tory. For use as probes, complete complementarity is 
desirable, though it may be unnecessary as the length of 
the fragment is increased. 

For use of such probes as diagnostics, the bio- 
• logical sample to be analyzed, such as blood or serum 
may be treated, if desired, to extract the nucleic acids 
contained therein. The resulting nucleic acid from the 
sample may be subjected to gel electrophoresis or other 
size separation techniques; alternatively, the nucleic 
acid sample may be dot blotted without size separation 
The probes are then labeled. Suitable labels, and 
methods for labeling probes are known in the art, and 
include, for example, radioactive labels incorporated by 
nick translation or kinasing, biotin, fluorescent probes 
and chemiluminescent probes. The nucleic acids extracted 
from the sample are then treated with the labeled probe 
under hybridization conditions of suitable stringencies 
Usually high stringency conditions are desirable in order 
to prevent false positives. The stringency of 
hybridization is determined by a number of factors dur- 
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ing hybridization and during the washing procedure, 
including temperature, ionic strength, length of time, 
and concentration of formamide. These factors are 
outlined in, for example, Maniatis, T. (1982) MOLECULAR 
5 CLONING; A LABORATORY MANUAL (Cold Spring Harbor Press, 
Cold Spring Harbor , N. Y. ) • 

Generally, it is expected that the HCV genome 
sequences will be present in serum of infected 
individuals at relatively low levels, i.e., at ap- 

2 3 

10 proximately 10 -10 chimp infectious doses (CID) per ml. 
This level may require that amplification techniques be 
used in hybridization assays. Such techniques are known 
in the art. For example, the Enzo Biochemical 
Corporation "Bio-Bridge" system uses terminal 

15 deoxynucleotide transferase to add unmodified 

3'-poly-dT-tails to a DNA probe. The poly dT-tailed 
probe is hybridized to the target nucleotide sequence, 
and then to a biot in-modified poly-A. PCT App. No. 
84/03520 and EPO Pub. No. 124,221 describe a DNA 

20 hybridization assay in which: (1) analyte is annealed to 
a single-stranded DNA probe that is complementary to an 
enzyme-labeled oligonucleotide; and (2) the resulting 
tailed duplex is hybridized to an enzyme-labeled 
oligonucleotide. EPO Pub. No. 204,510 describes a DNA 

25 hybridization assay in which analyte DNA is contacted 

with a probe that has a tail, such as a poly-dT tail, an 
amplifier strand that has a sequence that hybridizes to 
the tail of the probe, such as a poly-A sequence, and 
which is capable of binding a plurality of labeled 

30 strands. 

A particularly desirable technique may first 
involve amplification of the target HCV sequences in sera 
approximately 10,000-fold, i.e., to approximately 10 6 
sequences/ml. This may be accomplished, for example, by 
35 the polymerase chain reactions (PCR) technique described 
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which is by Saiki et al. (1986) Nature 22A: 163 , Mullis, 
U.S. Patent No. 4,683,195, and Mullis et al. U.S. Patent 
No. 4,683,202. The amplified sequence(s) may then be 
detected using a hybridization assay which is described 
5 in co-pending European Publication No. 317-077 and 

Japanese application No. 63-260347, which are assigned to 
the herein assignee, and are hereby incorporated herein 
by reference. These hybridization assays, which should 
detect sequences at the level of 10 6 /ml, utilize nucleic 

10 acid multimers which bind to single-stranded analyte 
nucleic acid, and which also bind to a multiplicity of 
single-stranded labeled oligonucleotides. A suitable 
solution phase sandwich assay which may be used with 
labeled polynucleotide probes, and the methods for the 

15 preparation of probes is described in EPO Pub. No. 

225,807 which is hereby incorporated herein by refer- 
ence. 

The probes can be packaged into diagnostic 
kits. Diagnostic kits include the probe DNA, which may 

20 be labeled; alternatively, the probe DNA may be unlabeled 
and the ingredients for labeling may be included in the 
kit in separate containers. The kit may also contain 
other suitably packaged reagents and materials needed for 
the particular hybridization protocol, for example, 

25 standards, wash buffers, as well as instructions for 
conducting the test. 

Both the HCV/J1 or HCV/J7 polypeptides which 
react immunologically with serum containing HCV 
antibodies and the antibodies raised against the HCV 

30 specific epitopes in these polypeptides are useful in 

immunoassays to detect presence of HCV antibodies, or the 
presence of the virus and/or viral antigens, in bio- 
logical samples. Design of the immunoassays is subject 
to a great deal of variation, and a variety of these are 

35 known in the art. An immunoassay for anti-HCV antibody 
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may utilize one viral epitope or several viral epitopes. 
When multiple epitopes are used, the epitopes may be 
derived from the same or different viral polypeptides, 
and may be in separate recombinant or natural 
S polypeptides, or together in the same recombinant 
polypeptides . 

An immunoassay for viral antigen may use, for 
example, a monoclonal antibody directed towards a viral 
epitope, a combination of monoclonal antibodies directed 
10 towards epitopes of one viral polypeptide, monoclonal 
antibodies directed towards epitopes of different viral 
polypeptides, polyclonal antibodies directed towards the 
same viral antigen, polyclonal antibodies directed 
towards different viral antigens or a combination of 
15 monoclonal and polyclonal antibodies. 

Immunoassay protocols may be based, for 
example, upon competition, or direct reaction, or 
sandwich type assays. Protocols may also, for example, 
use solid supports, or may be by immunoprecipitation. 
Most assays involve the use of labeled antibody or 
polypeptide. The labels may be, for example, 
fluorescent, chemiluminescent, radioactive, or dye 
molecules. Assays which amplify the signals from the 
probe are also known. Examples of which are assays 
which utilize biotin and avidin, and enzyme-labeled and 
mediated immunoassays, such as ELISA assays. 

Typically, an immunoassay for anti-HCV antibody 
will involve selecting and preparing the test sample, 
such as a biological sample, and then incubating it with 
an antigenic (i.e., epitope-containing) HCV polypeptide 
under conditions that allow antigen-antibody complexes to 
form, such conditions are well known in the art. m a 
heterogeneous format, the polypeptide is bound to a solid 
support to facilitate separation of the sample from the 
35 polypeptide after incubation. Examples of solid supports 
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that can be used are nitrocellulose, in membrane or 
microtiter well form, polyvinylchloride, in sheets or 
microtiter wells, polystyrene latex, in beads or 
microtiter plates, polyvinyl idine fluoride, known as 
5 immobulon , diazotized paper, nylon membranes, activated 
beads, and Protein A beads. Most preferably, the 
Dynatech, Immulon™ i microtiter plate or the 0.25-inch 
polystyrene beads, which Spec finished by Precision 
Plastic Ball, are used in the heterogeneous format. The 
solid support is typically washed after separating it 
from the test sample. In a homogeneous format, the test 
sample is incubated with antigen in solution, under 
conditions that will precipitate any antigen-antibody 
complexes that are formed, as is know in the art. The 
precipitated complexes are then separated from the test 
sample, for example, by centr if ugation . The complexes 
formed comprising anti-HCV antibody are then detected by 
any of a number of techniques. Depending on the format, 
the complexes can be detected with labeled anti- 
xenogeneic Ig or, if a competitive format is used, by 
measuring the amount of bound, labeled competing 
antibody. 

in immunoassays where HCV polypeptides are the 
analyte, the test sample, typically a biological sample, 
is incubated with anti-HCV antibodies again under 
conditions that allow the formation of antigen-antibody 
complexes. Various formats can be employed, such as a 
••sandwich" assay where antibody bound to a solid support 
is incubated with the test sample; washed; incubated with 
a second, labeled antibody to the analyte; and the 
support is washed again. Analyte is detected by 
determining if the second antibody is bound to the 
support, in a competitive format, which can be either 
heterogeneous or homogeneous, a test sample is usually 
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incubated with and antibody and a labeled, competing 
antigen either sequentially or simultaneously. These and 
other formats are well known in the art. 

The Flavivirus model for HCV allows predictions 
5 regarding the likely location of diagnostic epitopes for 
the virion structural proteins. The C, pre-M, M, and E 
domains are all likely to contain epitopes of significant 
potential for detecting viral antigens, and particularly 
for diagnosis. Similarly, domains of the nonstructural 

10 proteins are expected to contain important diagnostic 

epitopes (e.g., NS5 encoding a putative polymerase; and 
NS1 encoding a putative complement-binding antigen) . Re- 
combinant polypeptides, or viral polypeptides, which 
include epitopes from these specific domains may be 

15 useful for the detection of viral antibodies in 

infections blood donors and infected patients. In 
addition, antibodies directed against the E and/ or M 
proteins can be used in immunoassays for the detection of 
viral antigens in patients with HCV caused NANBH, and in 

20 infectious blood donors. Moreover, these antibodies may 
be extremely useful in detecting acute-phase donors and 
patients. 

Antigenic regions of the putative polyprotein 
can be mapped and identified by screening the 

25 antigenicity of bacterial expression products of HCV 

cDNAs which encode portions of the polyprotein. Other 
antigenic regions of HCV may be detected by expressing 
the portions of the HCV cDNAs in other expression 
systems, including yeast system^ and cellular systems 

30 derived from insects and vertebrates. In addition, 
studies giving rise to an antigenicity index and 
hydrophobicity/hydrophilicity profile give rise to 
information concerning the probability of a region's 
antigenicity. Efficient detection systems may include 
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the use of panels of epitopes. The epitopes in the panel 
may be constructed into one or multiple polypeptides. 

Kits suitable for immunodiagnosis and contain- 
ing the appropriate labeled reagents are constructed by 
5 packaging the appropriate materials, including the 

polypeptides of the invention containing HCV epitopes or 
antibodies directed against HCV epitopes in suitable 
containers, along with the remaining reagents and materi- 
als required for the conduct of the assay (e.g., wash 
10 buffers, detection means like labeled anti-human Ig, 

labeled anti-HCV, or labeled HCV antigen) , as well as a 
suitable set of assay instructions. 

The HCV/Ji and HCV/J7 nucleotide sequence 
information described herein may be used to gain further 
15 information on the sequence of the HCV genomes, and for 
identification and isolation of additional HCV isolates 
related to Jl or J7. This information, in turn, can lead 
to additional polynucleotide probes, polypeptides derived 
from the HCV genome, and antibodies directed against HCV 
epitopes which would be useful for the diagnosis and/or 
treatment of HCV caused NANBH. 

The HCV/Jl and HCV/J7 nucleotide sequence 
information herein is useful for the design of probes for 
the isolation of additional sequences which are derived 
from as yet undefined regions of the HCV genomes from 
which the Jl and J7 sequences are derived. For example, 
labeled probes containing a sequence of approximately 8 
or more nucleotides, and preferably 20 or more 
nucleotides, which are derived from regions close to the 
5 '-termini or 3 '-termini of the family of HCV cDNA 
sequences disclosed in the examples may be used to 
isolate overlapping cDHA sequences from HCV cONA 
libraries. These sequences which overlap the cDNAs in 
the above-mentioned clones, but which also contain 
sequences derived from regions of the genome from which 
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the cDNA in the above mentioned clones are not derived , 
may then be used to synthesize probes for identification 
of other overlapping fragments which do not necessarily 
overlap the cDNAs described below. Methods for 

5 constructing cDNA libraries are known in the art. See, 
e.g. EPO Pub. No. 318,216. It is particularly preferred 
to prepare libraries from the serum of Japanese and other 
Asian patients diagnosed as having NANBH demonstrating 
antibody to HCV1 antigens; these are believed to be the 

10 most likely candidates for carriers of HCV/J1, HCV/J7, or 
related isolates. 

HCV particles may be isolated from the sera 
from individuals with NANBH or from cell cultures by any 
of the methods known in the art, including for example, 

15 techniques based on size discrimination such as 

sedimentation or exclusion methods, or techniques based 
on density such as ultracentrifugation in density 
gradients, or precipitation with agents such as 
polyethylene glycol, or chromatography on a variety of 

20 materials such as anionic or cationic exchange materials, 
and materials which bind due to hydrophobicity. 

A preferred method of isolating HCV. particles 
or antigen is by immunoaff inity columns. Techniques for 
immunoaf f inity chromatography are known in the art, 

25 including techniques for affixing antibodies to solid 
supports so that they retain their immunoselective 
activity. The techniques may be those in which the anti- 
bodies are adsorbed to the support (see, for example, 
Kurstak in ENZYME IMMUN0DIA6N0SIS , page 31-37), as well 

30 as those in which the antibodies are covalently linked to 
the support. Generally, the techniques are similar to 
those used for covalent linking of antigens to a solid 
support, described above. However, spacer groups may be 
included in the bifunctional coupling agents so that the 

35 antigen binding site of the antibody remains accessible. 
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The antibodies may be monoclonal, or polyclonal, and it 
may be desirable to purify the antibodies before their 
use in the immunoassay. 

The general techniques used in extracting the 
5 genome from a virus, preparing and probing a cDNA 
library, sequencing clones, constructing expression 
vectors, transforming cells, .performing immunological 
assays such as radioimmunoassays and ELISA assays, for 
growing cells in culture, and the like are known in the 
art and laboratory manuals are available describing these 
techniques. However, as a general guide, the following 
sets forth some sources currently available for such 
procedures, and for materials useful in carrying them 
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Both prokaryotic and eukaryotic host cells may 
be used for expression of desired coding sequences when 
appropriate control sequences which are compatible with 
the designated host are used. Among prokaryotic hosts, 
&*. SSli is most frequently used. Expression control 
sequences for prokaryotes include promoters, optionally 
containing operator portions, and ribosome binding sites I 
Transfer vectors compatible with prokaryotic hosts are 
commonly derived from, for example, pBR322, a plasmid 
containing operons conferring ampicillin and tetracycline 
25 resistance, and the various puc vectors, which also 

contain sequences conferring antibiotic resistance mark- 
ers. These markers may be used to obtain successful 
transformants by selection. Commonly used prokaryotic 
control sequences include the Beta-lactamase 
(penicillinase) and lactose promoter systems (Chang et 
al.(i977), Nature I2fi:i056, the tryptophan (trp) promoter 
system (Goeddel et al. (1980) Nucleic Acid Res. 1:4057), 
and the lambda-derived P L promoter and N gene ribosome 
binding site (Shimatake et al. (1981) Nature 222:128) and 
the hybrid tac promoter (De Boer et al. (1983) Proc. 
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Natl. Acad. soi. USA iaas 12 8) derived from sequences of 
the fea and lac UV5 promoters. The foregoing systems ai 
particularly compatible with £. ssli; if desired, other 
prokaryotic hosts such as strains of Bacillus or 
5 Pseudomonas may be used, with corresponding control 
sequences . 

Eukaryotic hosts include yeast and mammalian 
cells in culture systems. Saccharn^.c gerevi 8 ^o 
SascharofflY c^. carlsberqpnsjs, Klgbsigia and ^ ichl 

10 BMS&Eis. are the most commonly used yeast hosts, and are 
convenient fungal hosts. Yeast compatible vectors carry 
markers which permit selection of successful 
transf ormants by conferring prototrophy to auxotrophic 
mutants or resistance to heavy metals on wild-type 

15 strains. Yeast Compatible vectors may employ the 2 

micron origin of replication (Broach et al. (1983) Math 
Enz. 101:307), the combination of CEN3 and ARSl or other 
means for assuring replication, such as sequences which 

20 I«i TT ^ inC ° rporation <* ^ appropriate fragment 
20 into the host cell genome, control sequences for yeast 
vectors are known in the art and include promoters for 
the synthesis of glycolytic enzymes (Hess et al. (i 96 8) 
J. Adv. Enzyme Eng. 2 :149; Holland et al. (19 78 ) j 
Biol. chem. 25£:i385), including the promoter for 3 * 
25 phosphoglycerate kinase (Hitzeman (1980) , j. Biol, chem 
255:2073). Terminators may also be included, such as 
those derived from the enolase gene (Holland (1981) , j 
Biol. Chem. a5j>:1385) . Particularly useful control 
systems are those which comprise the glyceraldehyde-3 
Phosphate dehydrogenase (GAPDH) promoter or alcohol 
dehydrogenase (ADH) regulatable promoter, terminators 
also derived from GAPDH, and if secretion is desired 
leader sequence from yeast alpha factor, m addition 
the transcriptional regulatory region and the ' 
35 transcriptional initiation region which are operably 
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1 inked may be such that they are not naturally associated 
in the wild-type organism. These systems are described 
in detail in EPO Pub* No. 120,551; EPO Pub. No. 116 f 201; 
and EPO Pub. No. 164,556 all of which are incorporated 
5 herein by reference. 

Mammalian cell lines available as hosts for 
expression are known in the art and include many 
immortalized cell lines available from the American Type 
Culture Collection (ATCC) , including HeLa cells, Chinese 

10 hamster ovary (CHO) cells, baby hamster kidney (BHK) 
cells, and a number of other cell lines. Suitable 
promoters for mammalian cells are also known in the art 
and include viral promoters such as that from Simian 
Virus 40 (SV40) (Fiers (1978), Nature 221:113), Rous 

15 sarcoma virus (RSV) , adenovirus (ADV) , and bovine 

papilloma virus (BPV) . Mammalian cells may also require 
terminator sequences and poly A addition sequences; 
enhancer sequences which increase expression may also be 
included, and sequences which cause amplification of the 

20 gene may also be desirable. These sequences sure known in 
the art. Vectors suitable for replication in mammalian 
cells may include viral replicons, or sequences which 
insure integration of the appropriate sequences encoding 
NANBV epitopes into the host genome. 

25 The vaccinia virus system can also be used to 

express foreign DNA in mammalian cells. To express 
heterologous genes, the foreign DNA is usually inserted 
into the thymidine kinase gene of the vaccinia virus and 
then infected cells can be selected. This procedure is 

30 known in the art and further information can be found in 
these references [Mackett et al. J. Virol. 12: 857-864 
(1984) and Chapter 7 in DNA Cloning , Vol. 2, IRL Press]. 

In addition, viral antigens can be expressed in 
insect cells by the Baculovirus system. A general guide 

35 to baculovirus expression by Summer and Smith is A Manual 

* * 
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of Methods for Baculovirus Vectors and Insect Cell 

Culture Procedures (Texas Agricultural Experiment Station 
Bulletin No. 1555) • To incorporate the heterologous gene 
into the Baculovirus genome the gene is first cloned into 
5 a transfer vector containing some Baculovirus sequences. 
THis transfer vector, when it is cotransfected with wild- 
type virus into insect cells, will recombine with the 
wild-type virus. Usually, the transfer vector will be 
engineered so that the heterologous gene will disrupt the 

10 wild-type Baculovirus polyhedron gene. This disruption 
enables easy selection of the recombinant virus since the 
cells infected with the recombinant virus will appear 
phenotypically different from the cells infected with the 
wild-type virus. The purified recombinant virus can be 

15 used to infect cells to express the heterologous gene. 

The foreign protein can be secreted into the medium if a 
signal peptide is linked in frame to the heterologous 
gene; otherwise, the protein will be bound in the cell 
lysates. For further information, see Smith et al Mol. & 

20 Cell. Biol. 3.: 2156-2165 (1983) or Luckow and Summers in 
Virology 1Z: 31-39 (1989). 

Transformation may be by any known method for 
introducing polynucleotides into a host cell, including, 
for example packaging the polynucleotide in a virus and 

25 transducing a host cell with the virus, and by direct 
uptake of the polynucleotide. The transformation 
procedure used depends upon the host to be transformed. 
Bacterial transformation by direct uptake generally 
employs treatment with calcium or rubidium chloride 

30 (Cohen (1972), Proc. Natl. Acad. Sci. USA 69:2110; 

Maniatis et al. (1982), MOLECULAR CLONING; A LABORATORY 
MANUAL (Cold Spring Harbor Press, Cold Spring Harbor, 
N.Y.). Yeast transformation by direct uptake may be 
carried out using the method of Hinnen et al. (1978) 

35 Proc. Natl. Acad. Sci. USA 75: 1929. Mammalian 
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transformations by direct uptake may be conducted using 
the calcium phosphate precipitation method of Graham and 
Van der Eb (1978), Virology 51:546 or the various known 
modifications thereof. 
5 Vector construction employs techniques which 

are known in the art. site-specific DNA cleavage is 
performed by treating with suitable restriction enzymes 
under conditions which generally are specified by the 
manufacturer of these commercially available enzymes. 

10 The cleaved fragments may be separated using 

polyacrylamide or agarose gel electrophoresis techniques, 
according to the general procedures found in Methods in 
Enzymology (1980) 65:499-560. Sticky ended cleavage 
fragments may be blunt ended using £• Sfili DNA polymerase 

15 I (Klenow) in the presence of the appropriate 
jV deoxynucleotide triphosphates (dNTPs) present in the 

mixture. Treatment with SI nuclease may also be used, 

4 

resulting in the hydrolysis of any single stranded DNA 
A portions. 

20 Ligations are carried out using standard buffer 

and temperature conditions using T4 DNA ligase and ATP; 
sticky end ligations require less ATP and less ligase 
than blunt, end ligations. When vector fragments are used 
as part of a ligation mixture, the vector fragment is 
25 often treated with bacterial alkaline phosphatase (BAP) 
or calf intestinal alkaline phosphatase to remove the 
5 '-phosphate and thus prevent religaticm of the vector; 
alternatively, restriction enzyme digestion of unwanted 
fragments can be used to prevent ligation. Ligation 
30 mixtures are transformed into suitable cloning hosts, 
such as £. coll , and successful transformants selected 
by, for example, antibiotic resistance, and screened for 
the correct construction. 

Synthetic oligonucleotides may be prepared 
35 using an automated oligonucleotide synthesizer as 
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described by Warner (1984), DNA 2:401. if desired, the 
synthetic strands may be labeled with 32 p by treatment 

32 

with polynucleotide kinase in the presence of P-ATP, 
using standard conditions for the reaction, DNA 
5 sequences, including those isolated from cDNA libraries, 
may be modified by known techniques, including, for 
example site directed mutagenesis, as described by Zoller 
(1982), Nucleic Acids Res. 10:6487. 

DNA libraries may be probed using the procedure 

10 of Grunstein and Hogness (1975), Proc. Natl. Acad. Sci. 
USA 72:3961. Briefly, in this procedure, the DNA to be 
probed is immobilized on nitrocellulose filters, 
denatured, and prehybridized with a buffer. The 
percentage of form amide in the buffer, as well as the 

15 time and temperature conditions of the prehybridization 
and subsequent hybridization steps depends on the 
stringency required. Oligomer ic probes which require 
lower stringency conditions are generally used with low 
percentages of formamide, lower temperatures, and longer 

20 hybridization times. Probes containing more than 30 or 

40 nucleotides such as those derived from cDNA or genomic 
sequences generally employ higher temperatures, e*g. , 
about 40-42°C, and a high percentage, e.g., 50%, 
formamide. Following prehybridization, 5'- P-labeled 

25 oligonucleotide probe is added to the buffer, and the 

filters are incubated in this mixture under hybridization 
conditions. After washing, the treated filters are 
subjected to autoradiography to show the location of the 
hybridized probe; DNA in corresponding locations on the 

30 original agar plates is used as the source of the desired 
DNA. 

An enzyme-linked immunosorbent assay (ELISA) 
can be used to measure either antigen or antibody 
concentrations. This method depends upon conjugation of 
35 an enzyme to either an antigen or an antibody, and uses 
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the bound enzyme activity as a quantitative label. To 
measure antibody, the known antigen is fixed to a solid 
phase (e.g., a microplate or plastic cup), incubated with 
test serum dilutions, washed, incubated with 
5 anti-immunoglobulin labeled with an enzyme, and washed 
again. Enzymes suitable for labeling are known in the 
art, and include, for example, horseradish peroxidase. 
Enzyme activity bound to the solid phase is measured by 
adding the specific substrate, and determining product 
10 formation or substrate utilization color imetrically. The 
enzyme activity bound is a direct function of the amount 

of antibody bound. 

To measure antigen, a known specific antibody 
is fixed to the solid phase, the test material containing 
15 antigen is added, after an incubation the solid phase is 
washed, and a second enzyme-labeled antibody is added. 
After washing, substrate is added, and enzyme activity is 
estimated colorimetrically, and related to antigen 
concentration . 

20 

Examples 

I 

This example describes the cloning of the HCV/ Jl and 
HCV/J7 nucleotide sequences. 

25 Both blood samples which were used as a source 

of HCV virions were found to be positive in an anti-HCV 
antibody assay. THe HCV isolates from these samples were 
named HCV/J1 and HCV/J7. The infectivity of the blood 
sample containing the Jl isolate was confirmed by a 

30 prospective study of blood transfusion recipients. Dr. 
Tohru Katayama from the Department of Surgery at the 
National Tokyo Chest Hospital collected blood from 
patients who have contracted post-transfusion non-A, non- 
B hepatitis. He also collected blood samples from the 

35 respective blood donors of these patients. Next, these 
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samples were assayed for antibodies to the C100-3 HCVl 
antigen (EPO Pub. No. 318,216), and blood from one of the 
donors was found to be positive. 

Isolation of the RNA from the blood samples 
5 began by pelleting virions in the blood sample by 
ultracentrifugation [Bradley, D.W., McCaustland, KA 
Cook E.H., schable, C.A., Ebert, j.w. and Maynard, J.E. 
(1985) Gastroenterology M , 773-779]. rna was then 
extracted from the pellet by the guanidinium/cesium 
10 chloride method [Maniatis T., Fritsch, E.P., and 
Sambrook J. (i 98 2) "Molecular Cloning: A Laboratory 
Manual- , Cold Spring Harbor Laboratory, Cold Spring 
Harbor] and further purified by phenol/chloroform 
extraction in the presence of urea, [Berk, A.J. Lee F 
Harrison, T. , Williams, J. and Sharp, p.a. (1979) Cell' 

12, 935-944]. 

Five pairs of synthetic oligonucleotide primers 
were designed from the C/E, E, E/NS1, NS3, and NS5 
domains of the nucleotide sequence of HCVl to isolate 
fragments from the Ji and J7 genome. The first set of 
primers were to isolate the sequence from the core and 
some of the envelope domain. The second set of primers 
were to isolate the sequences in the envelope domain 
The third set of primers were to isolate a fragment which 
overlapped the putative envelope and non-structural one 
NSi, domains. The fourth and fifth set of primers were' 
used to isolate fragments from non-structural domains 
three and five, NS3 and NS5. The sequences for the 
various primers are shown below: 

The sequence of the primers for the C/E region were: 

21S 5' CGTGCCCCCGCAAGACTGCT 3' 

J80A 5' CCGTCCTCCAGAACCCGGAC 3' 

The sequence of the primers for the E region were: 
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7 IS 5' GCCGACCTCATGG GGTACAT 3' 

J132A 5' AACTGCGACACCACTAAGGC 3' 

The sequence of the primers for the E/NS1 region were: 

5 12 7 S -5' TGGCATGGGATATGATGATG 3' 

16 6 A 5' TTGAACTTGTGGTGATAGAA 3 ' 

The sequence of the primers for the NS3 region were : 

464S 5' GGCTATACCGGCGACTTCGA 3' 

10 52 6 A 5' GACATGCATGTCATGATGTA 3' 

The sequence of the primers for the NS5 region were: 

87 OS 5' GCTGGAAAGAGGGTCTACTA 3' 

9 17 A 5' GTTCTTACTGCCCAGTTGAA 3' 

15 

1 iiq of the antisense primers, 166A, 526A, or 
917A, was added to 10 units of reverse transcriptase 
(Biorad) to synthesize cDNA fragments from the isolated 
UNA as the template. The cDNA fragments were then 
20 amplified by a standard polymerase chain reaction [Saiki, 
R.K. , Scharf, S., Faloona, F., Mullis, K.B., Horn G.T., 
Erlich, H.A., and Arnheim, N. (1985) Science 13fi, 1350- 
1354] after l fig of the appropriate sense primer, 2 IS, 

71S, 127S, 464S or 870S, was added. 

25 The cDNA fragments amplified by the PCR method 

were gel isolated and cloned by blunt-end ligation into 
the Smal site of pUC119 [Vieira, J. and Messing, J. 
(1987) Methods in Enzymology 152, 3-11] or into the SnaBI 
site of charomid SB, a derivative of the cloning vector 

30 charomid 9-42 [Saito, I. and Stark, G. (1986) Proc. Natl. 
Acad. Sci. USA 13: 8664-8668]. Clones which contain the 
fragments of the five viral domains were successfully 
constructed. 
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Jl and « T . raa ° ti0n °' *"* Japanase Elates, 
and " three independent clones from each region, 

dideoxy chain termination method. 

isolate ( , SeqU !" Ce *"» a11 r< * io » s except c/« has been 

real! s T " 1S ° late - SeqUence £ «" -ay the C/ E 
region has been isolated from the J7 isolate 

surprislngiy, fragments isolated fro. both isolates ar. 
neither longer or shorter than what would be predict.,, 
Z°° tte HCTl However, there is heLr^nei^ 

between clones containing serene. from the saml "ion 
Consequently, a census sequence was construct J f« 

Xecti^r^' c/E - e - e/bsi ' h " - » SS > 

the PCR nmii,- ' miI ' ata whlch °«wr randomly during 
r^lf v tSam ' R - K " *aloona, 

: Horn ' G - T - Eriich - h - a - 

«. (1985) Science 230, 1350-135.ii 

is that »o~ *u * nother explanation 

is that more than one virus genome is present in the 

POasma of a single healthy carrier and that these genomes 
are heterogeneous at the nucleotide level. 9""»es 

To clarify this point, it was determined how 

Zl I*"* nU ! le0tide — « ""no 

ac,d changes using the seguenc. from the »S3 domain of 

the Jl isolate as an example. out OI ^ Bucleotl(Je 

^" Q ~' f" 6 £a " - *»- third position of the 
amino acid oodon and do not change the amino acid 

faToHheT ? T re " ainin9 *- <*an 9 es 
°" Potion of the amino acid codon and 

p™Une S t r r" ° f thre ° nine *° alanine »d 

acid reel " °* ^ ™ SM11 ' amino 

acid residues, similarly, when analysing the nucleotide 

differences in other domains, many silent and conserved 
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mutations are found. These results suggest that 
nucleotide sequences of the HCV genomes in the plasma of 
a single healthy donor are heterogeneous at the 
nucleotide level. 

5 In addition, once the consensus sequences for 

each of the fragments were compiled each sequence was 
compared to the HCVl isolate in Figures 6 through 10. m 
Figure 6 the fragment from the C/E region of the J7 
isolate shows a 92.8%, 512/552, nucleotide and 97.4%, 
150/154, amino acid homology to the HCVl isolate. The 
fragment from the E domain of Jl shows a slightly lower 
nucleotide and amino acid homology to HCVl in Figure 7 of 
76.2% and 82.9%, respectively. The fragment from the Jl 
isolate which overlaps the envelope and non-structural 
one domains shows the lowest homology to HCVl, as seen in 
Figure 8, where the Jl isolate has a 71.5% nucleotide 
homology and a 73.5% amino acid homology to HCVl. Figure 
9 shows a comparison of the fragment from the NS3 domain 
of Ji to HCVl . The homology between the nucleotides 
sequences is 79.8%, while the amino acid homology between 
the isolates is quite high, 92.2% or 179/194 amino acids 
Figure 10 shows the homology between the NS5 sequences 
from Jl and HCVl. The sequences have a 84.3% nucleotide 
and 88.7% amino acid homology. 

The vectors described in the examples above 
were deposited with the Patent Microorganism Depository 
Fermentation Institute, Agency of Industrial Science and 
Technology at 1-3, Higashi 1-chome Tsukuba-chi, Ibaragi- 
ken 305, Japan, and will be maintained under the 
provisions of the Budapest Treaty. The accession numbers 
and dates of the deposit are listed below, on page 68. 
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An HCV/Ji clone, Jl-1519, was isolated using 
the essentially the techniques described above. However, 
the primers used in the isolation were J159S and 199A. 
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The sequences of the oligomeric primers J159S and 199A, 
which follow, were based upon those in Jl-1216 and in 

* 

hcvi. 

J159S 5' ACT GCC CTG AAC TGC AAT GA 3' 

5 199A 5' AAT CCA GTT GAG TTC ATC CA 3' 

Clone Jl-1519 is comprised of an HCV cDNA 
sequence of 367 nucleotides which spans most of the 5'- 
half of the NS1 region and which overlaps the E-region 
clone, Jl-1216, by 31 nucleotides. Three independent 

10 clones spanning this region were sequenced; the sequences 
in this region obtained from the three clones were 
identical. The sequence of the HCV cDNA in Jl-1216 
(shown in the figure as Jl) and the amino acids encoded 
therein (shown above the nucleotide sequence) are shown 

15 in Figure 13. Figure 13 also shows the sequence 

differences between Jl-1216 in the comparable region of 
the prototype HCVI cDNA (indicated in the figure as PT) , 
and the resulting changes in the encoded amino acids. 
The homology between the Jl-1216 and HCVI cDNA is 

20 approximately 70% at the nucleotide level, and about 75% 
at the amino acid level. 

A composite of the sequences from the putative 
core to NS1 region of the Jl isolate is shown in Figure 
14; also shown in the figure are the amino acids encoded 

25 in the Jl sequence. The variation from the HCVI 

prototype sequence is shown in the line below the Jl 
nucleotide sequence; the dashed lines indicate homologous 
sequences. The nonhomologous amino acid encoded in the 
HCVI prototype sequence is shown below the HCVI 

30 nucleotide sequence. 

Cloned material containing the Jl/1519 HCV cDNA 
(pSl-1519) has been maintained in DH5a, and deposited 
with the Patent Microorganism Depository. 



35 



WO 91/04262 



59 



PCT/US90/0S242 



IS 

Several regions of the Jl isolate, 
including the C200-C100 region from the putative NS3-NS4 
region (which encompasses the region encoding the 5-1-1 
5 polypeptide in HSV1 (See EPO Pub. No. 318,216), and the 
putative NS1 - £ region, were amplified using the PCR 
method. The C200-C100 region includes nucleotides 3799 
to 5321 of the prototype HCV1. RNA was extracted as 
described above, except that extraction was with 

10 guanidinium thiocyanate in the presence of Proteinase K 
and sodium dodecylsulfate (SDS) (Maniatis (1982), supra). 
The RNA was transcribed into HCV cDNA by incubation in a 
25 Ml reaction comprised of 1 mM of each primer, 40 units 
of RNase inhibitor (RNASIN) , 5 units of AMV reverse 

15 transcriptase, and salts and buffer necessary for the 
reaction. Amplification of a segment of the HCV cDNA 
from the designated region was performed utilizing pairs 
of synthetic oligomer 16-mer primers. PCR amplification 
was accomplished in three rounds (PCR I, PCR II, and 

20 PCRIII) . The second and third rounds of PCR 

amplification (PCR II) utilized different sets of PCR 
primers; the first PCR reaction was diluted 10-fold and 
multiple rounds of PCR amplification were carried out 
with the new primers, so that ultimately up to 50% of the 

25 products of the first PCR reaction (PCR I) were 

reamplified. The primers used for the amplification of 
the regions were the following. These primers, with the 
exception of J1C200-3 which was derived from the Jl 
isolate sequence, were derived from the prototype HCV1 

30 sequence. 

;v. f : : : Primers for amplific ation of the "5-1-1" region 
; from NS3-NS4. 

PCR I 

511/16A (sense, derived from nucleotides starting at 
35 number 1528 of HCV1) 
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5' AAC AGG CTG CGT GGT C 3' 

(anti-sense, derived from nucleotides ending at 

5260 of HCV1) 9 

5' AGT TGG TCT GGA CAG C 3' 

5 PCR II 

iU/aa (sense the hcv portion aerivea fro- nucleotides 
startle at number 5057 of HSVl; the restriction ensyme 
site is underlined) 

5' fiESSA&EES TCG TCT TGT CCG GGA AGO CGG CAA TC 3' 

10 Simse (anti-sense, the Hcv portion derived from 
nucleotides ending at number 5233 of HSVl; the 
restriction enzyme site is underlined) 

5' CCT CTG CCT GAC GGG ACG CGG TCT GC 3' 

PCRIII ^ 3 

15 5-11/35A (see supra) 

2Sfitc7 (antisense, derived from nucleotides ending at 
number 5804 of hsvi) 

5' GTA GTG CGT GGG GGA AAC AT 3' 
Primers for amial if f~. rion o f fha ^ . 

20 PCR I ' — recQoj 

Sa^o'aT 6 '^ P0 " l0n nucleotides 
starting at number 953 of hsv 1; the restriction enzyme 
site is underlined) 

5' gyTAGAATTg TGG CAT GGG ATA TGA TGA TG 3 ' 
™ (Se ! Se ' the HCV P° rti °« ^rived from nucleotides 

sitfi^ » 1087 ° f HSV1 ' the friction enzyme 

site is underlined) 



25 



30 



35 



5' CTTAGAATTC TCC ATG GTG GGG AAC TGG GC 3' 

(anti-sense, the HCV portion derived from 
nucleotides ending at 1995 of HSVl, the restriction 
enzyme site is underlined) 

5' £23ma33£ TAA CGG GCT GAG CTC GGA 3' 

21ES13 (anti-sense, the HCV portion derived from 
nucleotides ending at 1941 of HSVl, the restriction 
enzyme site is underlined) 
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5 ' CTTAGAATTC CGT CCA GTT GCA GGC AGC TTC 3 ' 

PCR II 

Jlrcl3 (see supra) 

5 Jiiz-l (sense, the HCV portion is derived from 
nucleotides starting at number 1641 of HCV1, the 
restriction enzyme site is underlined) 

5' CTTGAATTC CAA CTG GTT CGG CTG TAC A 3' 

jllZ-2 (sense, the HCV portion is derived from 
10 nucleotides starting at number 1596 of HCV1, the 
restriction enzyme site is underlined) 

5' TGA GAC GGA CGT GCT GCT CCT 3' 
primers for the C200-C100 region o f the "NS3-NS4" region 

PCR I 

15 J1C200-1 (sense, derived from nucleotides starting at 

number 3478 of HCV1) 

5/ TCC TAC TTG AAA GGC TC 3' 

J1C200-3 (anti-sense, derived from nucleotides ending at 

number 4402 of HCV1) 
20 5' GGA TCC AAG CTG AAA TCG AC 3' 

Jlrc52 (anti-sense, the HCV portion derived from 
nucleotides ending at 5853 of HCVl, the restriction 
enzyme site is underlined) 

5' CTTAGAATTC GAG GCT GCT GAG ATA GGC AGT 3' 
25 511 /16A (see above) . 
PCR II 

J1C200-2 (sense, the HCV portion derived from nucleotides 
starting at number 3557 of HCVl, the restriction enzyme 
site is underlined) 

30 5' CTTGAATTC CCC GTG GAG TGG CTA AGG CGG TGG ACT 3' 

J1C200-4 (anti-sense, the HCV portion derived from 
nucleotides ending at 4346 of HCVl, the restriction 
enzyme site is underlined) 

5/ CTTGAATTC TCG AAG TCG CCG GTA TAG CCG GTC ATG 3' 

35 511 /35A (see above) 
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(anti-sense, the HCV portion derived from 
nucleotides ending at 5826 of HCVl, the restriction 
enzyme site is underlined) 

5' Smmj&EQ GGC AGC TGC ATC GCT CTC CGG CAC 3* 

The amplified HCV cDNAs were either sequenced directly 
without cloning, and/ or were cloned. Sequencing wis 
accomplished using an assymetric PCR technique 
essentially as described in Shyamala and Ames, j 
10 ^ i ° 1 ° gy ^ 16 °2 (1989). in this technique,' 
10 amplication of the cDNA is carried out with a limiting 
concentration of one of ..j . -""Uniting 

n or one of the primers (usually in a ratio 

of one strand. The preferentially a^lified strand is 
then sequenced by the dideoxy chain termination -ethod. 

the PCR JTj Xlm " U3 " 3 —*-*«*<' fencing by 
th. PCR method »ere the folloving. Por the Nsi region- 
al « JlrC " (SS,UenCea Wlth " 0th " ^=13 
(conf iraed on both strands) . For th. HS3-HS4 region 

20 cioo c-teramal region, and the 5-1-1 region- Jiraoo , 
«- «c 2 00-7 (for the -terminal region If «o™ 

and aicaoo-4 M d jicoo-* (£or the c 2 oo-oioo c-t^i 

region, ; and 5 l V3 5A and hep 4 (for ^ ^ 
The sequences for ,10200-3, ,10300-4. and SH^Iere" 

"^oTa"'' t T SeqU ' nCea ° f ^ << »d 
J1C200-7 are the following. 

hs^Merived fro. nucleotides starting at number 5415 

f POrtl ° n deriVed «*• ""Cleotides 
starting , t number 3875 of ^ ^ restri 

site is underlined) enzyme 
5' CJasA^ CGT ACT CCA CCT ACG GCA AGT TCC TT 3' 
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J1C200-7 (the HCV portion derived from nucleotides 
starting at number 3946 of HCVl, the restriction enzyme 
site is underlined) 

5' CTTGAATTC GTG GCA TCC GTG GAG TGG CAC TCG TC 3 ' 

5 The sequences obtained by assymetric sequencing 

of the "NS1" region, the C200-C100 region, and the 5-1-1 
region are shown in Figure 15, and Figure 16, 
respectively. In the figures, the amino acids encoded in 
the Jl sequence are shown above the Jl nucleotide 
10 sequence. The differences between the Jl sequence and 

the HCVl prototype nucleotide sequence is shown below the 
Jl sequence (the dashes indicate homologous nucleotides 
in both sequences) . The encoded amino acids which differ 
in the HCVl prototype sequence are shown below the HCVl 

* 

15 nucleotide sequence* 

HCV cDNAs from the NS1 region, the C200-C100 
region, and the 5-1-1 region were cloned. A 300 bp and a 
230 bp fragment from the putative NS1 region, were cloned 
into a derivative of the commercially available vector, 
20 pGEM-3Z, in host HB101, and deposited with the ATCC as 
AW-300bp. The derivative vectors maintain the original 
PGEM-3Z polylinkers, an intact Amp r gene, and the genes 
required for replication in £. coli . The HCV cDNA 
fragments may be removed with Sad and Xbal. HCV cDNAs 
25 containing 770 bp N-terminal fragments of C200 were 
cloned into pMIE in HB101, 12 clones were pooled and 
deposited with the ATCC as AW-770bp-N; the HCV cDNA may 
be removed from the vector with Haell. The resultant 
Haell fragment will contain vector DNA of 300 bp and 250 
30 bp at the 5' and 3' ends, respectively. HCV cDNAs 
containing 700 bp C-terminal fragments of C200 (AW- 
700bp-C) were cloned into M13mpl0 and maintained in host 
DH5a-F'; cloning was into the vector poly linker site. 
The resultant phage were pooled, and deposited with the 
35 ATCC on September 11, 1990 as AW-700bp-N or AW-700bp-C. 
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HCV CDNA from Jl equivalent to the 5-1-1 region of HCVl 
was cloned into mpl9 R1 si te , and ttaintained ^ 
Several mi3 phage superanants from this cloning were * 
pooled and deposited with the ATCC as Ji 5-1-! on 

5 !ZT er 199 °- CDNAS ™* b * chained from 

the Phage by treatment with EcoRi. Accession numbers for 

Jl 5-1-1 and AW-700bp-N or A W -700bp-c may be obtained by 
telephoning the ATCC at (301) 881-2600. 

10 a. ««. - T abOVe - describe d cloned material was 
^l°nT ^ AmeriCan TyPe Collection 
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An HCV cDNA librarv contain^^ 

fho . . . . y containing sequences of 

the putative "NS1« region of the ji < Ml 4. 

h« , 1 isola t« was created 

bidirectional cloning in X-gt22. The -m- region 

^nT/"! ab ° Ut nUCleotide ^ about nucleotide 

2730 using the numbering system of the HCVl prototype 
nucleic acid sequence, where nucleotide 1 is the f Sst 

m£T f initi3ting -don for ^ 

ess^r r° tein ' accomplished using 

» d6SCribed by Han and *>tter in 

GENETIC EH6IHEERIH6 Vol 10 fT » <,-.., 

,,„,.., _ ' 01 10 (, - K - Satlow, Ed., Plenum 

PubZ"hin, »- — W that the primers for the 

JHC67 and JHC68, respectively, and the source of ool wae 
the » plasma. In ttia „ etnod ^ ^ ls extract ™ A J^ 

guanidium thiooyanat. at a low temperature. The pIT 
then converted to full length cDNA, which is cloned in a 

Ze r entatl ° n relSttVe t0 ttS P-cter i„ 
Phage. Osing this method, the HCV cDNAs to Jl kn» ««. 

ZTTJZ: 0,6 NotI slte of - ~ . H 
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The sequence of a region of "NS1" downstream 
from the region shown in Figure 14, but which overlaps 
the region by about 20 nucleotides, was determined using 
the assyxnetric sequencing technique described above, but 
5 substituting as primers for PCR amplification, Alx 61 and 
Alx 62. The resulting sequence is shown in Figure 17. 
(It should be noted that the PCR amplification was of a 
region from about nucleotide 1930 to about nucleotide 
2340; this region is also encompassed in the sequence 
10 shown in Figure 15) . The sequences of the primers and 

probes used to obtain the HCV cDNA library in X-gt22, and 
to sequence the portion of the "NSl" region were the 
following. 
JHC 67 

15 5' GACGC GGCCG CCTCC GTGTC CAGCG CGT 3' 

JHC 68 

5' CGTGC GGCCG CAAGA CTGCT AGCCG AGGT 3' 

ALX 61 

5' ACCTG CCACT GTGTA GTGGT CAGCA GTAAC 3' 

20 ALX 62 

5' ACGGA CGTCT TCGTC CTTAACAATA CCAGG 3' 

ALX 54 

5' GAACT TTGCG ATCTG GAAGACAGGG ACAGG 3' 
A 400 bp fragment of Jl HCV cDNA derived from 
25 the sequenced region was cloned into pG£M3z and 

maintained in HB101; the HCV cDNA may be removed from the 
vector with SacI and Xbal. Host cells transformed with 
the vector (JH-400bp) have been deposited with the ATCC. 

A pooled cDNA library was created from the Jl 
30 serum; the pooled library spans the Jl genome and is 

identified as HCV-Jl X gt22. The pooled cDNA library was 
created by pooling aliquots of 11 individual cDNA 
libraries, which had been prepared using the directional 
cloning technique described above, except that the 
35 libraries were created from primers which were designed 
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to yield HCV cDNAs which spanned the genome. The primers 
were derived from the sequence of HCV1, and included JHC 
67 and JHC 68. The HCV cDNAs were inserted into the Noti 
site of X-gt22. The pooled cDNA library, HCV-Ji X gt22, 
5 has been deposited with the ATCC. 

The sequence of a region of the polynucleotide 
upstream of that shown in Figure 14 was determined. This 
region begins at nucleotide -267 with respect to the HCV- 
1 (See Figure 12) and extends for 560 nucleotides. 
Sequencing was accomplished by preparing HCV cDNA from 
RNA extracted from Ji serum, and amplifying the HCV cDNA 
using the PCR method. 

RNA was extracted from 100 /tl of serum 
following treatment with proteinase K and sodium 
dodecylsulf ate (SDS) . The samples were extracted with 
phenol-chloroform, and the RNA precipitated with ethanol. 

HCV cDNA from the Ji isolate was prepared by 
denaturing the precipitated RNA with 0.01M MeHgOH; after 
ten minutes at room temperature, 2-mercaptoethanol was 
added to sequester the mercury ions. Immediately, the 
mix for the first strand of cDNA synthesis was added, and 
xncubation was continued for 1 hr at 37°c. The 
conditions for the synthesis of the anti-sense strand 
were the following: 50 mM Tris HC1, p H 8.3, 75 mH KC1, 3 
mM MgCl 2 , io mM dithiothreitol, 500/iM each 
deoxynucleotide triphosphate, 250 pmol specific antisense 
cDNA primer r25, 250 units MMLV reverse transcriptase. 
In order to synthesize the second strand (sense) , the 
synthesis reaction components were added, and incubated 
for one hour at I4°c. The components for the second 
strand reaction were as follows: 14 mM Tris HC1, pH 8.3 
68 mM KC1, 7.5 mM ammonium sulfate, 3.5 mM Mgci 2 , 2.8 mM 
dithiothreitol, 25 units DNA polymerase I, and one unit 
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RNase H. The reactions were terminated by heating the 
samples to 95°C for 10 minutes, followed by cooling on 

ice. 

The HCV cDNA was amplified by two rounds of 
5 PCR. The first round was accomplished using 20 jil of the 
cDNA mix. The conditions for the PCR reaction were as 
follows: 10 mM Tris HC1, pH 8.3, 50 mM KC1, 1.5 mM MgCl 2 , 
0.002% gelating, 200 mM each of the deoxynucleotide 
triphosphates, and 2.5 units Amplitaq. The PCR thermal 
10 cycle was as follows: 94°C one minute, 50°C one minute, 
72°C one minute, repeated 40 times followed by seven 
minutes at 72°C. The second round of PCR was 
accomplished using nested primers (i.e. primers which 
bound to an internal region of the first round of PCR 
amplified product) to increase the specificity of the PCR 
products. One percent of the first PCR reaction was 
amplified essentially as the first round, except that the 
primers were substituted, and the second step in the PCR 
reaction was at 60°C instead of 50°C. The primers used 
for the first round of PCR were ALX90 and r!4. The 
primers used for the second round of PCR were rl4 and 
pl4. 

The sequences of the primers for the synthesis 
of HCV cDNA and for the PCR method were the following. 
r25 

5' ACC TTA CCC AAA TTG CGC GAC CTA 3' 
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ALX90 



rl4 



pl4 



5' CCA TGA ATC ACT CCC CTG TGA GGA ACT A 3' 



5' GGG CCC CCAG CTA GGC CGA GA 3' 



35 



5' AAC TAC TGT CTT CAC GCA GAA AGC 3' 
The PCR products were gel purified, the 
material which migrated as having about 615 bp was 
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isolated, and sequenced by a modification of the Sanger 
dxdeoxy chain termination method, using « P - A TP as Ub-1 
in the modified method, the sequence replication was 

5 IT* USlng ^ PriMerS; the double landed 

was melted for 3 minutes at 95°c prior to 

replication, and the synthesis of labeled dideoxy 

terminated polynucleotides was catalyzed by Est 

polymerase (obtained from BioRad Corp.,, according to the 

« u^L ST"; 8 direCti0nS * The S ^ci„g was performed 
using 500ng to 1 „ g of PC R product per sequencing 
reaction. 

uerp „ 4 The Priners P32 < sen ^) and R31 (antisense, 
were derived from nucleotides -137 to -i 15 and from 
nucleotides «2 to 173, respectively, of the HCVl 
sequence. The sequences of the primers are the 
following. 

P32 primer 

5' AAC CCG CTC AAT GCC TGG AGA TT 3' 

R31 primer 

20 

5' GGC CGX CGA GCC TTG GGG AT 3' 

where X = a or G 

whi e >, ^ SeqUenCe ° f the re 9ion in the Ji isolate 
which encompasses the 5 -untranslated region as well as a 
part of the region of the putative "Core" is shown in 
Figure 18. m the figure, amino acids encoded in the ji 
sequence are shown above the nucleotide sequence. The 
sequence of the prototype HCVl is shown below the ji 

Z"£L£ daSh6S indlCate S6qUenCe h ° a0l °^ With 
The differing amino acids encoded in the HCVl sequence 

30 are shown below the HCVl sequence. 

of the CDNA fragment WhiCh iS a ^esentative 

of the 600 bp ji sequence described above (TC 600bp) was 

cloned into PG^SZ and maintained in host HB101; the h" 
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cDNA fragment may be removed with SacI and Xbal. This 
material is on deposit with the ATCC. 



Patent Microorganism Depository-deposited under Budapest 
5 Treaty terms. 

Deposited Materials Accession Number Deposit Date 

E. coli DH5/pSl-8791a BP-2593 9/15/1989 

(This clone contains 427 bp of the HS5 domain of Jl) 
E. coli HB101/pUl-1216c BP-2594 9/15/1989 

10 (This clone contains 351 bp of the E/NS1 domains of Jl) 
E. coli HB101/pUl-4652d BP-2595 9/15/1989 

(This clone contains 583 bp of the NS3 domain of Jl) 
E.coli DH5cr/pSl-713c BP-2637 11/1/1989 

(This clone contains 580 bp of the E domain of Jl) 

15 E. coli DH5a/pS7-28c BP-2638 11/1/1989 

(This clone contains 552 bp of the C/E domain of J7) 
E. coli DH5cr/psl-1519 BP3081 8/30/90 

The following vectors described in the Examples 
20 were deposited with the American Type Culture Collection 
(ATCC) , 12301 Parklawn Dr., Rockville, Maryland 20852, 
and have been assigned the following Accession Numbers. 
The deposits were- made under the terms of the Budapest 
Treaty. 

25 Deposited Materials Accession Number Deposit Date 

TC-600BP (in 

E. coli HB101/pGEM3Z) 68393 9/11/90 

JH~400bp (in 

E. coli HB101/pGEM3Z) 68394 9/11/90 

30 AW-300bp (in 

E, coli HB101/pGEM3Z) 68392 9/11/90 

AW-770bp-N (in 

E. coli HBlOl/pMIE) 68395 9/11/90 

AW-700bp-C or AW-700bp»N (in 
35 E. coli DHSa-F'/MlSmplO) 
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Jl 5-1-1 (in 

E. coli DH5a-F'/M13mpl0) 
HCV-J1 X gt22 40884 9/6/90 

These deposits are provided for the convenience of those 
5 skilled in the art. These deposits are neither an 

admission that such deposits are required to practice the 
present invention nor that equivalent embodiments are not 
within the skill of the art in view of the present 
disclosure. The public availability of these deposits is 
10 not a grant of a license to make, use or sell the 

deposited materials under this or any other patent. The 
nucleic acid sequences of the deposited materials are 
incorporated in to present disclosure by reference, and 
are controlling if in conflict with any sequences 
15 described herein. 

While the present invention has been described 
. < by way specif ic examples for the benefit of those in the 

field, the scope of the invention is not limited as 
j ! , additional embodiments will be apparent to those of skill 
; ■ 20 in the art from the present disclosure* 

. 1 , \ 
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CLAIMS 



5 l. A DNA molecule comprising a nucleotide 

sequence of at least 15 bp from an HCV isolate 
substantially homologous to an isolate selected from the 
group Jl and J7, wherein said nucleotide sequence is 
distinct from the nucleotide sequence of HCV isolate 
10 HCV1 • 

2. A DNA molecule comprising a nucleotide 
sequence of at least 15 bp encoding an amino acid 
sequence from the HCV isolate Jl or J7 wherein the Jl or 

15 J7 amino acid sequence is distinct from the amino acid 
sequence of HCV isolate HCV1. 

3. A DNA molecule according to claim 2 wherein 
the Jl or J7 amino acid sequence comprises a 

20 substantially complete viral polypeptide. 

4. A DNA molecule according to claim 2 wherein 
the J7 amino acid sequence is amino acid 1 to amino acid 
115. 

25 

5. A DNA molecule according to claim 1 wherein 
the Jl amino acid sequence is from amino acid 116 to 
amino acid 350. 

30 6. A DNA molecule according to claim 2 wherein 

the Jl amino acid sequence is from amino acid 351 to 
amino acid 651. 
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7. A DNA molecule according to claim 2 wherein 
the Jl amino acid sequence is from amino acid 1007 to 
amino acid 1650. 

5 8. A DNA molecule according to claim' 2 wherein 

the Jl amino acid sequence is from amino acid 2100 to the 
end of the coding sequence. 



9. A purified polypeptide comprising an amino 
10 acid sequence from an HCV isolate substantially 

homologous to an isolate selected from the group 
consisting of Jl and J7 wherein the amino acid sequence 
is distinct from the sequence of the polypeptides encoded 
by the HCV isolate HCV1. 

15 

10. A purified polypeptide according to claim 
9 wherein the Jl or J7 amino acid sequence comprises an 
epitope that is not immunologically cross-reactive with 
any HCV1 epitope. 

20 

11. A purified polypeptide according to claim 
9 vherein the J7 amino acid sequence is from amino acid 1 
to amino acid 115. 



25 12. A purified polypeptide according to claim 

9 wherein the Jl amino acid sequence is from amino acid 
116 to amino acid 350. 

13. A purified polypeptide according to claim 
30 9 wherein the Jl amino acid sequence is from amino acid 
351 to amino acid €51. 



35 



14. A purified polypeptide according to claim 
9 wherein the Jl amino acid sequence is from amino acid 
1007 to amino acid 1650. 
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15. A purified polypeptide according to claim 
9 wherein the Jl amino acid sequence is from amino acid 
2100 to the end of the coding sequence. 

5 

16. A polypeptide comprising an amino acid 
sequence from an HCV isolate selected from the group 
consisting of Jl and J7 wherein the Jl or J7 amino acid 
sequence is distinct from the amino acid sequence of HCV 

10 isolate HCV1 and the polypeptide is immobilized on a 
solid support. 

17. An immunoassay for detecting the presence 
of anti-HCV antibodies in a test sample comprising: 

15 (a) incubating the test sample under 

conditions that allow the formation of antigen- 
antibody complexes with an antigenic 
polypeptide comprising an amino acid sequence 
from an HCV isolate selected from the group 

20 consisting of Jl and J7, wherein the amino acid 

sequence is distinct from the amino acid 
sequence of HCV isolate HCV1; and (b) the Jl or 
J7 amino acid sequence is not immunologically 
cross-reactive with HCV1; and 

25 (b) detecting an antigen-antibody complex 

comprising the antigenic polypeptide. 



18. An immunoassay according to claim 17 
wherein the Jl amino acid sequence is from a viral domain 
30 selected from the group consisting of amino acid sequence 
from amino acid 116 to amino acid 350, from amino acid 
351 to amino acid 651, from amino acid 1007 to amino acid 
1650, and from amino acid 2100 to the coding sequence. 
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19- An immunoassay according to claim 17 
wherein the Jl amino acid sequence is from amino acid 1 
to amino acid 115. 

5 20. An immunoassay according to claim 17 

wherein the test sample comprises human blood or a 
fraction thereof. 

21. A composition comprising anti-HCV 

10 antibodies that bind an HCV epitope substantially free of 
antibodies that do not bind an HCV epitope, wherein: 

(a) the HCV epitope comprises an amino acid 

sequence from an HCV isolate selected from the 

group consisting of Jl and J7; 
15 (b) the Jl of J7 amino acid sequence is 

distinct from the amino acid sequence of HCV 

isolate HCV1; and 

(c) the Jl or J7 amino acid sequence is not 
immunologically cross-reactive with HCV1. 

20 

22. A composition according to claim 21 
wherein the anti-HCV antibodies are polyclonal. 

23. A composition according to claim 21 
25 wherein the anti-HCV antibodies are monoclonal. 



24 . An immunoassay for detecting the presence 
of an HCV polypeptide in a test sample comprising: 

(a) incubating the test sample under 
conditions that allow the formation of antigen- 
antibody complexes with anti-HCV antibodies 
that bind an HCV epitope wherein: 

(i) the HCV epitope comprises an amino 
acid sequence from an HCV isolate selected 
from the group consisting of Jl and J7; 



WO 91/04262 PCT/US90/05242 

-75- 

(ii) the Ji or J7 amino acid sequence is 
distinct from the amino acid sequence of 
HCV isolate HCVl; and 

(iii) the Ji or J7 amino acid sequence is 
5 not immunologically cross-reactive with 

HCVl; and 

(b) detecting an antigen-antibody complex 
comprising the anti-HCV antibodies. 
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25 



30 
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25. An immunoassay according to claim 24 
wherein the Ji amino acid sequence is from a viral domain 
selected from the group consisting of amino acid sequence 
from amino acid lie to amino acid 350, from amino acid 
351 to amino acid 651, from amino acid 1007 to amino acid 
1650, and from amino acid 2100 to the end of the coding 
sequence . 



26. An immunoassay according to claim 24 
wherein the J7 amino acid sequence is from amino acid 1 

20 to amino acid 115. 

27. A method of producing anti-HCV antibodies 
comprising administering to a mammal a polypeptide 
comprising an amino acid sequence from an HCV isolate 
selected from the group consisting of Ji and J7 wherein 
the Ji or J7 amino acid sequence is distinct from the 
amino acid sequence of HCV isolate HCVl whereby the 
mammal produces anti-HCV antibodies. 



28. A method of detecting HCV polynucleotides 
in a test sample comprising: 

(a) providing a probe comprising the DNA 
molecule of claim 1; 

(b) contacting the test sample and the probe 
under conditions that allow for the formation 
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of a polynucleotide duplex between the probe 
and its complement in the absence of 
substantial polynucleotide duplex formation 
between the probe and non-HCV polynucleotide 
sequences present in the test sample; and 
(c) detecting any polynucleotide duplexes 
comprising the probe. 



ln , 29 * A method of Producing a recombinant 

iu polypeptide comprisina an hcv 

. . *'*- lSiI1 y an hcv ammo acid sequence, the 

method comprising: 

(a) providing host cells transformed by a DNA 
construct comprising a control sequence for the 

15 r^ 11 ° Perably Unked t0 a COdin * se ^ce 

for the host cell operably linked to a coding 

sequence encoding an amino acid sequence from 

an HCV isolate selected from the group 
comprised of ji and J7 wherein the Ji or J7 

20 • !f "L 1 ! diStinCt fr ° m amin ° acid sequence 

; of HCV isolate HCV1; 

(b) growing the host ceils under conditions 
whereby the coding sequence is transcribed and 
translated into the recombinant polypeptide- 



and 

25 



30 



(c) recovering the recombinant polypeptide. 

30. A biological material derived from the 
group consisting of materials deposited under Accession 

Numbers BP-2593, BP-2594, BP-2595, BP-2637 ^-2638 ^ 
ATCC No. 68395, and ATCC No. 408884. 
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J7 1 

discrepancy 
clone 



1/ 79 

AGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGT 



J7 37 ACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGG 

Met Ser Thr Asn 

J7 73 TCTCGTAGACCGTGCATC ATG AGC ACA AAT 

Pro Lys Pro Gin Arg Lys Thr Lys Arg 

J7 103 CCT AAA CCC CAA AGA AAA ACC AAA CGT 

T G 
b 1 

Arg 



Asn Thr Asn Arg Arg Pro Gin Asp Val 

J7 130 AAC ACC AAC CGT CGC CCA CAG GAC GTT 

b 



Lys Phe Pro Gly 

J7 157 AAG TTC CCG GGC 

T 
1 

Leu 



Gly Gly Gin lie Val 

GGT GGT CAG ATC GTC 

T 
b 



J7 



Gly Gly Val Tyr Leu Leu Pro Arg Arg 

184 GGT GGA GTT TAC TTG TTG CCG CGC AGG 

A 
b 



Gly Pro Arg Leu Gly Val Arg Ala Thr 

J7 211 GGC CCC AGG TTG GGT GTG CGT GCG ACT 

FIG. 1-1 
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2/79 

Arg Lys Thr Ser Glu Arg Ser Gin Pro 

AGG AAG ACT TCC GAG CGG TCG CAA CCT 

A 
b 



Arg Gly Arg Arg Gin Pro lie Pro Lys 

CGT GGA AGG CGA CAA CCT ATC CCC AAG 

Ala Arg Arg Pro Glu Gly Arg Thr Trp 

GCT CGC CGG CCC GAG GGC AGG ACC TGG 



Ala Gin Pro Gly Tyr Pro Trp Pro Leu 

GCT CAG CCT GGG TAT CCT TGG CCC CTC 



Tyr Gly Asn Glu Gly Leu Gly Trp Ala 

TAT GGC AAT GAG GGC TTG GGG TGG GCA 

A 
b 
END 



Gly Trp Leu Leu Ser Pro Arg Gly Ser 

GGA TGG CTC CTG TCA CCC CGC GGC TCT 



Arg Pro Ser Trp Gly Pro Asn Asp Pro 

CGG CCT AGT TGG GGC CCC AAT GAC CCC 

T C 
C b 

Thr 



Arg Arg Arg Ser Arg Asn Leu Gly Lys 

CGG CGT AGG TCG CGT AAT TTG GGT AAG 



Val lie Asp Thr Leu Thr Cys Gly Phe 

GTC ATC GAT ACd CTT ACA TGC GGC TTC 

FIG. 1-2 i 

Leu 
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Ala Asp Leu Met Gly Tyr lie Pro Leu 

GCC GAC CTC ATG GGG TAC ATT CCG CTT 

C C 
C b 



Val Gly Ala Pro Leu Gly Gly Ala Ala 

GTC GGC GCC CCC TTA GGG GGC GCT GCC 



Arg Ala Leu Ala His Gly 

AGG GCC CTG GCA CAT GGT 

FIG. 1-3 
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U I 79 

Pro Leu Val Gly Ala Pro Leu Gly Gly 

Jl 1 T CCG CTC GTC GGC GCC' CCC TTA GGG GGC 

discrepancy C 
clone 



Ala Ala Arg Ala Leu Ala His Gly Val 
29 GCT GCC AGG GCC CTG GCA CAT GGT GTC 

Arg Val Leu Glu Asp Gly Val Asn Tyr 

56 CGG GTT CTG GAG GAC GGC GTG AAC TAT 

Ala Thr Gly Asn Leu Pro Gly Cys Ser 

83 GCA ACA GGG AAT TTG CCC GGT TGC TCT 



Phe Ser lie Phe Leu Leu Ala Leu Leu 

110 TTC TCT ATC TTC CTC TTG GCT CTG CTG 

A T 
g d 



Ser Cys Leu Thr lie Pro Ala Ser Ala 

137 TCC TGT TTG ACC ATC CCA GCT TCC GCT 



Tyr Glu Val Arg Asn Val Ser Gly lie 

164 TAT GAA GTG CGC AAC GTG TCC GGG ATA 



Tyr His Val Thr Asn Asp Cys Ser Asn 

191 TAC CAT GTC ACA AAC GAC TGC TCC AAC 

T 
d 



Ser Ser lie Val Tyr Glu Ala Ala Asp 

218 TCA AGC ATT GTG TAT GAG GCG GCG GAC 

FIG. 2-1 

I: ' ' 
1 ■ f 
• i 

t 

i • > 
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Val lie Met His Ala Pro Gly Cys Val 

GTG ATC ATG CAT GCC CCC GGG TGC GTG 



Pro Cys Val Arg Glu Asn Asn Ser Ser 

CCC TGC GTT CGG GAG AAC AAT TCC TCC 

C 
d 



Arg Cys Trp Val Ala Leu Thr Pro Thr 

CGT TGC TGG GTA GCG CTC ACT CCC ACG 



Leu Ala Ala Arg Asn Ala Ser Val Pro 

CTC GCG GCC AGG AAT GCC AGC GTC CCC 



Thr Thr Thr Leu Arg Arg His Val Asp 

ACT ACG ACA TTA CGA CGC CAC GTC GAC 

G 
d 



Leu Leu Val Gly Thr Ala Ala Phe Cys 

TTG CTC GTT GGG ACG GCT GCT TTC TGC 



Ser Ala Met Tyr Val Gly Asp Leu Cys 
TCC GCT ATG TAC GTG GGG GAT CTC TGC 



Gly Ser Val Phe Leu lie Ser Gin Leu 

GGA TCT GTT TTC CTC ATC TCC CAG CTG 

T 
d 



Phe Thr Phe Ser Pro Arg Arg His Glu 

TTC ACC TTC TCG CCT CGC CGG CAT GAG 

FIG. 2-2 
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Thr Val Gin Asp Cys Asn Cys Ser lie 

ACA 6TA CAG GAC TGC AAC TGC TCA ATC 



Tyr Pro Gly His Val Ser Gly His Arg 

TAT CCC GGC CAC GTA TCA GGC CAT CGC 

T 
C 



Met Ala Trp Asp Met Met Met Asn Trp 

ATG GCT TGG GAT ATG ATG ATG AAC TGG 



Ser Pro Thr Ala 

TCG CCC ACG GCA 

FIG. 2-3 
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Asn Trp Ser Pro Thr 

j! l AAC TGG TCG CCC ACG 

discrepancy 
clone 

altered aa 

Ala Ala Leu Val Val Ser Gin Leu Leu 

Jl 16 GCA GCC TTA GTG GTG TCG CAG TTA CTC 



i i i 

• • • 



Arg lie Pro Gin Ala Val Met Asp Met 

Jl 43 CGG ATC CCA CAA GCT GTC ATG GAC ATG 



Val Ala Gly Ala His Trp Gly Val Leu 

jl 70 GTG GCG GGG GCC CAC TGG GGA GTC CTA 

G 

* 

1 



Ala Gly Leu Ala Tyr Tyr Ser Met Val 

jl 97 GCG GGC CTT GCC TAC TAT TCC ATG GTG 

A 

■ 

1 



Gly Asn Trp Ala Lys Val Leu lie Val 

Jl 124 GGG AAC TGG GCT AAG GTT TTG ATT GTG 



Met Leu Leu Phe Ala Gly Val Asp Gly 
Jl 151 ATG CTA CTC TTT GCC GGC GTT GAC GGG 



His Thr Arg Val Thr Gly Gly Val Gin 

Jl 178 CAT ACC CGC GTG ACG GGG GGG GTG CAA 

AG A 
gg i 
Ser 

FIG. 3- 1 
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Jl 



205 



Gly His Val Thr Ser Thr Leu Thr Ser 

GGC CAC GTC ACC TCT ACA CTC ACG TCC 

T G 

C i 
Ala 



Jl 



232 



Leu Phe Arg Pro Gly Ala Ser, Gin Lys 
CTC TTT AGA CCT GGG GCG TCC CAG AAA 



Jl 



259 



lie Gin Leu Val Asn Thr Asn Gly Ser 

ATT CAG CTT GTA AAC ACC AAT GGC AGT 

TC T 
11 1 

Ser Leu 



Jl 



286 



Trp His lie Asn Arg Thr Ala Leu Asn 

TGG CAT ATC AAC AGG ACT GCC CTG AAC 

T 

g 



Jl 



313 



Cys Asn Asp Ser Leu Gin Thr Gly Phe 

TGC AAT GAC TCC CTC CAA ACT GGG TTC 



Jl 



340 



Leu Ala Ala Leu 

CTT GCC GCG CTG 

FIG. 3-2 
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Jl 1 
discrepancy 
clone 

altered aa 



Val 

GTG 


He 
ATC 


Asp 
GAC 


Cys 

TGT 


Asn 

AAC 


Thr 

ACA 


Cys 

TGT 


Val 

GTC 


Thr 

ACT 


Gin 
CAG 


Thr 
ACG 


Val 
GTC 


Asp 
GAT 


Phe 
TTC 


Ser 
AGC 


Leu 
TTG 


Asp 
GAT 


Pro 
CCC 


Thr 
ACC 
G 


Phe 
TTC 


Thr 

ACC 


He 
ATC 


Glu 
GAG 


Thr 
ACG 


Thr 
ACG 


Thr 
ACC 


Val 
GTG 


c 

Ala 




- 














Pro 

CCC, 


Gin 
CAA 


Asp 
GAT 


Ala 
GCG 


Val 
GTT 


Ser 
TCG 


Arg 
CGC 


Thr 
ACG 


Gin 
CAG 


Arg 

CGG 


Arg 
CGA 


Gly 
GGT 


Arg 
AGG 


Thr 
ACT 


Gly Arg 
GGC AGG 


Gly 
GGC 


Arg 
AGG 


Arg 
AGA 


Gly 
GGC 


He 
ATC 


Tyr 
TAT 


Arg 
AGG 


Phe 
TTT 


Val 
GTG 


Thr 
ACT 


Pro 
CCA 


Gly 
GGA 


GlU 
GAA 


Arg 

CGG 


Pro 
CCC 


Ser 
TCG 


Ala 
GCG 


Met 

ATG 


Phe 
TTC 


Asp 
GAT 


Ser 
TCT 


Ser 

TCG 


Val 
GTC 


Leu 
CTA 


Cys 
TGT 


Glu 
GAG 


Cys 
TGT 


Tyr Asp 

TAT GAC 


Ala 
GCG 


Gly 
GGC 


Cys 
TGT 


Ala 
GCT 


Trp 
TGG 


Tyr 
TAT 


Glu 
GAG 


Leu 
CTC 


Thr 
ACG 



A 
e 

Gly(=) 

FIG. 4- 1 
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Pro Ala Glu Thr Ser Val Arg Leu Arg 
Jl 248 CCC GCT GAG ACC TCG GTT AGG TTG CGG 



Ala Tyr Leu Asn Thr Pro Gly Leu Pro 

Jl 275 GCT TAC CTA AAT ACA CCA GGG TTG CCC 



Val Cys Gin Asp His Leu Glu Phe Trp 
Jl 302 GTC TGC CAG GAC CAT CTG GAG TTC TGG 



Glu Ser Val Phe Thr Gly Leu Thr His 

Jl 329 GAG AGC GTC TTC ACA GGC CTC ACC CAC 



Jl 



356 



lie Asp Ala His Phe Leu Ser Gin Thr 

ATA GAC GCC CAC TTC TTG TCC CAG ACT 



Jl 



383 



Lys Gin Ala Gly Asp Asn Phe Pro Tyr 

AAG CAG GCA GGA GAC AAC TTC CCC TAC 



Jl 



410 



Leu Val Ala Tyr Gin Ala Thr Val Cys 

CTG GTA GCA TAC CAA GCC ACA GTG TGC 



Ala Arg Ala Lys Ala Pro Pro Pro Ser 

Jl 437 GCC AGG GCT AAG GCT CCA CCT CCA TCG 

C 

© 

Ala(=) 



Trp Asp Gin Met Trp Lys Cys Leu lie 
Jl 464 TGG GAT CAA ATG TGG AAG TGT CTC ATA 



Arg Leu Lys Pro Thr Leu His Gly Pro 

Jl 491 CGG CTA AAG CCT ACG CTG CAC GGG CCA 

G 

e 
Ala 



)} 



FIG. 4-2 
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Thr Pro Leu Leu Tyr Arg Leu Gly Ala 

Jl 518 ACG CCC CTG CTG TAT AGG CTA GGA GCC 

A 

Arg(=) 

Val Gin Asn Glu Val Thr Leu Thr His 

Jl 545 GTC CAG AAT GAG GTC ACC CTC ACA CAC 



Pro lie Thr Lys 
Jl 572 CCT ATA ACC AAA 

FIG. 4-3 
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Leu Thr 

jl 1 C CTC ACC 

discrepancy 

clone 

altered aa 



Arg Asp Pro Thr Val Pro Leu Ala Arg 

Jl 8 CGT GAC CCC ACC GTC CCC CTT GCG CGG 



Ala Ala Trp Glu Thr Ala Arg His Thr 

35 GCT GCG TGG GAG ACA GCT AGA CAC ACT 

C 

g 

Thr(=) 



Pro Val Asn Ser Trp Leu Gly Asn lie 
Jl 62 CCA GTC AAC TCC TGG CTA GGC AAC ATC 



lie Met Tyr Ala Pro Thr Leu Trp Ala 

j! 8 9 ATC ATG TAT GCG CCC ACT TTG TGG GCA 

g 

Ile(=) 



Jl 



116 



Arg Met lie Leu Met Thr His Phe Phe 
AGG ATG ATT CTG ATG ACT CAC TTC TTC 



Jl 



143 



Ser He Leu Leu Ala Gin Glu Gin Leu 

TCC ATC CTT CTA GCC CAG GAG CAA CTT 



Glu Lys Ala Leu Asp Cys Gin He Tyr 
jl 170 GAA AAA GCC CTG GAT TGT CAA ATC TAC 

Gly Ala Cys Tyr Ser He Glu Pro Leu 

Jl -197 GGG GCC TGT TAC TCC ATT GAG CCA CTT 

FIG. 5-1 
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Asp Leu Pro Gin lie He Glu Arg Leu 

Jl 224 GAC CTA CCT CAG ATC ATT GAA CGA CTC 



His Gly Leu Ser Ala Phe Ser Leu His 
Jl 251 CAT GGT CTT AGC GCA TTT TCA CTC CAT 



Ser Tyr Ser Pro Gly Glu He Asn Arg 
Jl 278 AGT TAC TCT CCA GGT GAG ATC AAT AGG 



Val Ala Ser Cys Leu Arg Lys Leu Gly 

Jl 305 GTG GCT TCA TGC CTC AGG AAG CTT GGG 



Val Pro Pro Leu Arg Val Trp Arg His 

Jl 332 GTA CCA CCC TTG CGA GTC TGG AGA CAT 



Arg Ala Arg Ser Val Arg Ala Lys Leu 

Jl 359 CGG GCC AGA AGT GTC CGC GCT AAG CTA 



Leu Ser Gin Gly Gly Arg Ala Ala Thr 

Jl 386 CTG TCC CAA GGG GGG AGG GCC GCC ACT 

G 

g 

Gln(=) 



Lys Gly Lys Tyr Leu 
Jl 413 TGT GGC AAG TAC CTC 

FIG. 5-2 
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J7 1 AGCCGAGTAGTGTTGGGTCGCGAAA6GCCTTGTGGT 
HCVl 

J7 37 ACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGG 
HCVl 

Met Ser Thr Asn 
J7 73 TCTCGTAGACCGTGCATC ATG AGC ACA AAT 



HCVl 


















CI 




J7 
HCVl 


103 


Pro 
CCT 


Lys 
AAA 


Pro 
CCC 

m 

J. 


Gin 
CAA 


Arg 
AGA 

7A 

Lys 


Lys 
AAA 


Thr 

ACC 

A 

Asn 
*** 


Lys 


Arg 
CGT 


J7 
HCVl 


130 


Tfc am a* 

Asn 
AAC 


inr 
ACC 


Asn 
AAC 


CGT 


CGC 


Pro 
CCA 


Gin 
CAG 


GAC 


V U -I- 

GTT 
C 


J7 
HCVl 


157 


Lys 

AAG 


Phe 
TTC 


Pro 

CCG 


Gly 

GGC 
T 


Gly 

GGT 
C 


Gly Gin 

GGT CAG 


lie 
ATC 


Val 
GTC 
T 


J7 
HCVl 


184 


Gly 

GGT 


Gly 

GGA 


Val 

GTT 


Tyr 

TAC 


Leu 

TTG 


Leu 
TTG 


Pro 
CCG 


Arg 
CGC 


Arg 

AGG 


J7 
HCVl 


211 


Gly 

GGC 


Pro 

ccc 

T 


Arg 

AGG 
A 


Leu 

TTG 


Gly 
GGT 


Val 
GTG 


Arg 

CGT 
C 


Ala 
GCG 


Thr 

ACT 
G 


J7 


238 


Arg 

AGG 


Lys 

AAG 


Thr 

ACT 


Ser 
TCC 


GlU 
GAG 


Arg 

CGG 


Ser 

TCG 


Gin 
CAA 


Pro 

CCT 



HCVl A 

FIG. 6- 1 
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Arg Gly Arg Arg Gin Pro lie Pro Lys 

CGT GGA AGG CGA CAA CCT ATC CCC AAG 
A T A T G 



Ala Arg Arg Pro Glu Gly Arg Thr Trp 

GCT CGC CGG CCC GAG GGC AGG ACC TGG 



Ala Gin Pro Gly Tyr Pro Trp Pro Leu 

GCT CAG CCT GGG TAT CCT TGG CCC CTC 

C C 



Tyr Gly Asn Glu Gly Leu Gly Trp Ala 

TAT GGC AAT GAG GGC TTG GGG TGG GCA 

GC G 

Cys 
*** 



Gly Trp Leu Leu Ser Pro Arg Gly Ser 

GGA TGG CTC CTG TCA CCC CGC GGC TCT 

T T 



Arg Pro Ser Trp Gly Pro Asn Asp Pro 

CGG CCT AGT TGG GGC CCC AAT GAC CCC 

C CA 

Thr 
*** 



Arg Arg Arg Ser Arg Asn Leu Gly Lys 

CGG CGT AGG TCG CGT AAT TTG GGT AAG 

C 



Val He Asp Thr Leu Thr Cys Gly Phe 

GTC ATC GAT ACC CTT ACA TGC GGC TTC 

G 



FIG. 6-2 
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Ala Asp Leu Met Gly Tyr lie Pro Leu 

J7 481 GCC GAC CTC ATG GGG TAC ATT CCG CTT 

A C 



HCV1 



Val Gly Ala Pro Leu Gly Gly Ala Ala 

J7 508 GTC GGC GCC CCC TTA GGG GGC GCT GCC 
HCV1 T C T A 



Arg Ala Leu Ala His Gly 
J7 535 AGG GCC CTG GCA CAT GGT 
HCV1 6 c 

FIG. 6-3 



v 
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Pro Leu Val Gly Ala Pro Leu Gly Gly 

Jl IT CCG CTC GTC GGC GCC CCC TTA GGG GGC 

A T C T A 



Ala Ala Arg Ala Leu Ala His Gly Val 

jl 29 GCT GCC AGG GCC CTG GCA CAT GGT GTC 

G C 



Arg Val Leu Glu Asp Gly Val Asn Tyr 

jl 56 CGG GTT CTG GAG GAC GGC GTG AAC TAT 

A 



Ala Thr Gly Asn Leu Pro Gly Cys Ser 

Jl 83 GCA ACA GGG AAT TTG CCC GGT TGC TCT 

C C T T 



Phe Ser lie Phe Leu Leu Ala Leu Leu 

Jl 110 TTC TCT ATC TTC CTC TTG GCT CTG CTG 

T C C C 



Ser Cys Leu Thr lie Pro Ala Ser Ala 

Jl 137 TCC TGT TTG ACC ATC CCA GCT TCC GCT 

TC TGGC GC 

Val 



Tyr Glu Val Arg Asn Val Ser Gly lie 

Jl 164 TAT GAA GTG CGC AAC GTG TCC GGG ATA 

c c TCC AG C T 

Gin Ser Thr Leu 

*** *** 



Tyr His Val Thr Asn Asp Cys Ser Asn 

Jl 191 TAC CAT GTC ACA AAC GAC TGC TCC AAC 

C C T T C T 

FIG. 7- 1 
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Ser Ser lie Val Tyr Glu Ala Ala Asp 
218 TCA AGC ATT GTG TAT GAG GCG GCG GAC 
G T C C T 



Val He Met His Ala Pro Gly Cys Val 

245 GTG ATC ATG CAT GCC CCC GGG TGC GTG 
QC C C A T G C 

Ala Leu Thr 



Pro Cys Val Arg Glu Asn Asn Ser Ser 

272 CCC TGC GTT CGG GAG AAC AAT TCC TCC 
T T GG C G G 

Gly Ala 
*** 



, Arg Cys Trp Val Ala Leu Thr Pro Thr 

Jl 299 CGT TGC TGG GTA GCG CTC ACT CCC ACG 

AG T G AG C T 

; • . Met ' 



Leu Ala Ala Arg Asn Ala Ser Val Pro 

Jl 326 CTC GCG GCC AGG AAT GCC AGC GTC CCC 

GGCA G GAAC 

Val Thr Asp Gly Lys Leu 

*** *** 



Thr Thr Thr 

Jl 353 ACT ACG ACA 

G G CAG 

Ala Gin 

*** 



Leu Arg Arg 

TTA CGA CGC 
C T T 



His Val Asp 

CAC GTC GAC 
A T 
He 



Jl 



Leu Leu Val Gly Thr Ala Ala Phe Cys 
380 TTG CTC GTT GGG ACG GCT GCT TTC TGC 
C TC GCCACC T 

Ser Thr Leu 



FIG. 7-2 
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Ser Ala Met Tyr Val Gly Asp Leu Cys 

Jl 407 TCC GCT ATG TAC GTG GGG GAT CTC TGC 

G C C C C A 

Leu 



Gly Ser Val Phe Leu lie Ser Gin Leu 

Jl 434 GGA TCT GTT TTC CTC ATC TCC CAG CTG 

G C T T G GG A 

Val Gly 



Phe Thr Phe Ser Pro Arg Arg His Glu 
Jl 461 TTC ACC TTC TCG CCT CGC CGG CAT GAG 

T CAG C C TG 

Trp 
*** 



Thr Val Gin Asp Cys Asn Cys Ser lie 

Jl 488 ACA GTA CAG GAC TGC AAC TGC TCA ATC 

G ACG A GT T T 

Thr Gly 
*** *** 



Tyr Pro Gly His Val Ser Gly His Arg 

Jl 515 TAT CCC GGC CAC GTA TCA GGC CAT CGC 

T A AG T C 
lie Thr 



Met Ala Trp Asp Met Met Met Asn Trp 

Jl 542 ATG GCT TGG GAT ATG ATG ATG AAC TGG 

A 



Ser Pro Thr Ala 
Jl 569 TCG CCC ACG GCA 

C T AG 

Thr 



FIG. 7-3 
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^9/sn Trp Ser Pro Thr Ala 

Jl 1 AAC TGG TCG CCC ACG GCA 
HCV1 C T AG 

Thr 



Ala 

Jl 19 GCC 
HCV1 G 



Leu 


Val 


Val 


Ser 


TTA 


GTG 


GTG 


TCG 


G 


A 


A 


G T 






Met 


Ala 



Gin Leu Leu Arg 
CAG TTA CTC CGG 
C G 



lie Pro Gin Ala Val Met Asp Met Val 

Jl 46 ATC CCA CAA GCT GTC ATG GAC ATG GTG 
HCV1 CAT AC 

lie Leu lie 



Ala Gly Ala His Trp Gly Val Leu Ala 
Jl 73 GCG GGG GCC CAC TGG GGA GTC CTA GCG 
HCV1 T T T G 



Gly Leu Ala Tyr Tyr Ser Met Val Gly 

Jl 100 GGC CTT GCC TAC TAT TCC ATG GTG GGG 
HCV1 A A G T TC 

He Phe 



Asn Trp Ala Lys Val Leu He Val Met 

Jl 127 AAC TGG GCT AAG GTT TTG ATT GTG ATG 
HCV1 G C C G A C 

Val Leu 



Leu Leu Phe Ala Gly Val Asp Gly His 

Jl 154 CTA CTC TTT GCC GGC GTT GAC GGG CAT 
HCV1 G A C C G A 

Ala Glu 
*** 





Thr Arg 


Val 


Thr 


Gly 


Gly 


Val 


Gin 


Gly 


Jl "181 


ACC CGC 


GTG 


ACG 


GGG 


GGG 


GTG 


CAA 


GGC 


HCV1 


A 

His 


C 


C 




A 


AGT 

Ser 
*** 


GCC 
Ala 

*** 
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His Val Thr Ser Thr Leu Thr Ser Leu 
Jl 208 CAC GTC ACC TCT ACA CTC ACG TCC CTC 
HCV1 ACT GTG GG T T GTT AG 

Thr Val Gly Phe Val 
*** *** *** 



Phe Arg Pro Gly Ala Ser Gin Lys lie 

Jl 235 TTT AGA CCT GGG GCG TCC CAG AAA ATT 

HCV1 C C GC A C C AAG C G C 

Leu Ala Lys Asn Val 

*** *** *** 



Gin Leu Val Asn Thr Asn Gly Ser Trp 

Jl 262 CAG CTT GTA AAC ACC AAT GGC AGT TGG 

HCV1 G A C C 

He 



His He Asn Arg Thr Ala Leu Asn Cys 
Jl 289 CAT ATC AAC AGG ACT GCC CTG AAC TGC 
HCV1 C C T C G 

Leu Ser 

*** 



Asn Asp Ser Leu Gin Thr Gly Phe Leu 

Jl 316 AAT GAC TCC CTC CAA ACT GGG TTC CTT 

HCV1 T AG AC C C GG T G 

Asn Trp 





Ala 


Ala 


Leu* 


Jl 343 


GCC 


GCG 


CTG 


HCV1 


A 


G 


T 






Gly 
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Jl 1 

HCVl 



Ser Val lie 

C TCA GTG ATC 

ggctataccggcgacttcga G A 



Jl 

HCVl 



11 



Asp Cys Asn Thr Cys Val Thr Gin Thr 
GAC TGT AAC ACA TGT GTC ACT CAG ACG 
C T G C A 



Jl 

HCVl 



38 



Val Asp Phe Ser Leu Asp Pro Thr Phe 
GTC GAT TTC AGC TTG GAT CCC ACC TTC 

C T C T 



Jl 65 
HCVl 



Thr lie Glu Thr Thr Thr Val Pro Gin 

ACC ATC GAG ACG ACG ACC GTG CCC CAA 
T A TC G C C G 



Jl 

HCVl 



92 



Asp Ala Val Ser Arg Thr Gin Arg Arg 

GAT GCG GTT TCG CGC ACG CAG CGG CGA 
T C C T A T G 



Jl 

HCVl 



Gly Arg Thr Gly Arg Gly Arg Arg Gly 

119 GGT AGG ACT GGC AGG GGC AGG AGA GGC 
C G A CC 

Lys Pro 



Jl 146 
HCVl 



lie Tyr Arg Phe Val Thr Pro Gly Glu 

ATC TAT AGG TTT GTG ACT CCA GGA GAA 
C A G A G G G 

Ala 



Jl 173 
HCVl 



Hi 



I 

I 



Arg Pro Ser Ala Met Phe Asp Ser Ser 
CGG CCC TCG GCG ATG TTC GAT TCT TCG 
C C GC CGC 

Gly 



FIG. 9- 1 
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Val Leu Cys Glu Cys Tyr Asp Ala Gly 

Jl 200 GTC CTA TGT GAG TGT TAT GAC GCG GGC 

HCV1 CCA 



Cys Ala Trp Tyr Glu Leu Thr Pro Ala 

Jl 227 TGT GCT TGG TAT GAG CTC ACG CCC GCT 

HCV1 C 



Glu Thr Ser Val Arg Leu Arg Ala Tyr 

Jl 254 GAG ACC TCG GTT AGG TTG CGG GCT TAC 

HCV1 T A A C A A G 

Thr 



Leu Asn Thr Pro Gly Leu Pro Val Cys 

Jl 281 CTA AAT ACA CCA GGG TTG CCC GTC TGC 

HCV1 AGCCG CT G 

Met 



Gin Asp His Leu Glu Phe Trp Glu Ser 

Jl 308 CAG GAC CAT CTG GAG TTC TGG GAG AGC 

TAT G 

Gly 



HCV1 



Val Phe Thr Gly Leu Thr His lie Asp 
Jl 335 GTC TTC ACA GGC CTC ACC CAC ATA GAC 

HCV1 T T T T 

Ala His Phe Leu Ser Gin Thr Lys Gin 

Jl 362 GCC CAC TTC TTG TCC CAG ACT AAG CAG 

HCV1 T C A A 



Ala Gly Asp Asn Phe Pro Tyr Leu Val 

Jl 389 GCA GGA GAC AAC TTC CCC TAC CTG GTA 

HCV1 AGT G G C T T 

_ Ser Glu Leu 



FIG. 9-2 
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Ala Tyr Gin Ala Thr Val Cys Ala Arg 

GCA TAC CAA GCC ACA GTG TGC GCC AGG 
G C T 



Ala Lys Ala 

GCT AAG GCT 
C A C 
Gin 



Pro Pro Pro 
CCA CCT CCA 
T C 



Ser Trp Asp 

TCG TGG GAT 

C 



Gin Met Trp 
CAA ATG TGG 
G 



Lys Cys Leu 

AAG TGT CTC 

T G 



lie Arg Leu 
ATA CGG CTA 
T C C 



Lys Pro Thr Leu His Gly Pro Thr Pro 

AAG CCT ACG CTG CAC GGG CCA ACG CCC 
C C C T A 



Leu Leu Tyr Arg Leu Gly Ala Val Gin 

CTG CTG TAT AGG CTA GGA GCC GTC CAG 
A C A G C T T 



Asn Glu Val 

AAT GAG GTC 
A A 
He 



Thr Leu Thr 
ACC CTC ACA 
G G 



His Pro He 

CAC CCT ATA 
A G C 
Val 



Thr Lys 

ACC AAA 

tacatcatgacatgcatgtc 

FIG. 9-3 
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Jl 
HCV1 



Leu Thr 
C CTC ACC 



Jl 
HCV1 



Jl 
HCV1 



Jl 
HCV1 



8 



35 



62 



Arg 


Asp 


Pro 


Thr 


Val 


Pro 


Leu 


Ala 


Arg 


CGT 


GAC 


CCC 


ACC 


GTC 


CCC 


CTT 


GCG 


CGG 






T 


A 


AC 




C 




A A 










Thr 


















*** 










Ala 


Ala 


Trp 


Glu 


Thr 


Ala 


Arg 


His 


Thr 


GCT 


GCG 


TGG 


GAG 


ACA 


GCT 
A 


AGA 


CAC 


ACT 


Pro 


Val 


Asn 


Ser 


Trp 


Leu 


Gly Asn 


lie 


CCA 


GTC 


AAC 


TCC 


TGG 


CTA 


GGC 


AAC 


ATC 



Jl 

HCV1 



89 



lie Met Tyr Ala Pro Thr Leu Trp Ala 

ATC ATG TAT GCG CCC ACT TTG TGG GCA 

T C AC G 

Phe 



Jl 
HCV1 



Arg Met lie Leu Met Thr His Phe Phe 
116 AGG ATG ATT CTG ATG ACT CAC TTC TTC 

A CTT 



Jl 
HCV1 



Ser lie Leu Leu Ala Gin Glu Gin Leu 
143 TCC ATC CTT CTA GCC CAG GAG CAA CTT 
AG G A AG C G 

Val He Arg Asp 

* ** 



Jl 

HCV1 



Glu Lys Ala Leu Asp Cys Gin He Tyr 

170 GAA AAA GCC CTG GAT TGT CAA ATC TAC 

C G C CGG 

Gin Glu 

*** *** 
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Gly Ala Cys Tyr Ser lie Glu Pro Leu 
GGG GCC TGT TAC TCC ATT GAG CCA CTT 

C A A 



Asp Leu Pro Gin lie lie Glu Arg Leu 
GAC CTA CCT CAG ATC ATT GAA CGA CTC 
T CA C A 

Pro Gin 
*** *** 



His Gly Leu Ser Ala Phe Ser Leu His 

CAT GGT CTT AGC GCA TTT TCA CTC CAT 
C C C 



Ser Tyr Ser Pro Gly Glu lie Asn Arg 

AGT TAC TCT CCA GGT GAG ATC AAT AGG 

A T 



Val Ala Ser Cys Leu Arg Lys Leu Gly 

GTG GCT TCA TGC CTC AGG AAG CTT GGG 
C G A A 

Ala 



Val Pro Pro Leu Arg Val Trp Arg His 
GTA CCA CCC TTG CGA GTC TGG AGA CAT 
G CT C 

Ala 
*** 



Arg Ala Arg 

CGG GCC AGA 

C G 



Ser Val Arg 

AGT GTC CGC 
C 



Ala Lys Leu 
GCT AAG CTA 
G T 

Arg 



Leu Ser Gin Gly Gly Arg Ala Ala Thr 

CTG TCC CAA GGG GGG AGG GCC GCC ACT 

G AG A C T TA 

Ala Arg He 
*** *** 

...FIG. 10-2 
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HCV-1 



-2 67 GCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGG 
CGCAGATCGGTACCGCAATCATACTCACAGCACGTCGGAGGTCC 



-223 ACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGA 
TGGGGGGGAGGGCCCTCTCGGTATCACCAGACGCCTTGGCCACT 



-179 GTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATC AA 
CATGTGGCCTTAACGGTCCTGCTGGCCCAGGAAAGAACCTAGTT 



-135 CCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGC 
GGGCGAGTTACGGACCTCTAAACCCGCACGGGGGCGTTCTGACG 



-9 1 TAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
ATCGGCTCATCACAACCCAGCGCTTTCCGGAACACCATGACGGA 



-47 GATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGTAGACCGTGC 
CTATCCCACGAACGCTCACGGGGCCCTCCAGAGCATCTGGCACG 



-3 ACC -1 
TGG 



Met Ser Thr Asn Pro Lys Pro Gin Lys Lys Asn 
1 ATG AGC ACG AAT CCT AAA CCT CAA AAA AAA AAC 
TAC TCG TGC TTA GGA TTT GGA GTT TTT TTT TTG 



Lys Arg Asn Thr Asn Arg Arg Pro Gin Asp Val 

34 AAA CGT AAC ACC AAC CGT CGC CCA CAG GAC GTC 
TTT GCA TTG TGG TTG GCA GCG GGT GTC CTG CAG 



Lys Phe Pro Gly Gly Gly Gin He Val Gly Gly 

67 AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT GGA 
TTC AAG GGC CCA CCG CCA GTC TAG CAA CCA CCT 

FIG. 12-1 
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Val Tyr Leu 

100 GTT TAC TTG 
CAA ATG AAC 



29 / 79 
Leu Pro Arg Arg 

TTG CCG CGC AGG 
AAC GGC GCG TCC 



Gly Pro, Arg Leu 

GGC CCT AGA TTG 
CCG GGA TCT AAC 



Gly Val Arg Ala Thr Arg Lys Thr Ser Glu Arg 

133 GGT GTG CGC GCG ACG AGA AAG ACT TCC GAG CGG 
CCA CAC GCG CGC TGC TCT TTC TGA AGG CTC GCC 



Ser Gin Pro Arg Gly Arg Arg Gin Pro lie Pro 

166 TCG CAA CCT CGA GGT AGA CGT CAG CCT ATC CCC 
AGC GTT GGA GCT CCA TCT GCA GTC GGA TAG GGG 



Lys 

199 AAG 
TTC 



Ala Arg 

GCT CGT 
CGA GCA 



Arg Pro 

CGG CCC 
GCC GGG 



Glu Gly 
GAG GGC 
CTC CCG 



Arg Thr 

AGG ACC 
TCC TGG 



Trp Ala 

TGG GCT 
ACC CGA 



Gin Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn 

232 CAG CCC GGG TAC CCT TGG CCC CTC TAT GGC AAT 
GTC GGG CCC ATG GGA ACC GGG GAG ATA CCG TTA 



Glu Gly Cys Gly Trp Ala Gly Trp Leu Leu Ser 

265 GAG GGC TGC GGG TGG GCG GGA TGG CTC CTG TCT 
CTC CCG ACG CCC ACC CGC CCT ACC GAG GAC AGA 



Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr 

298 CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA 
GGG GCA CCG AGA GCC GGA TCG ACC CCG GGG TGT 



Asp Pro Arg Arg Arg Ser Arg Asn Leu Gly Lys 

331 GAC CCC CGG CGT AGG TCG CGC AAT TTG GGT AAG 
CTG GGG GCC GCA TCC AGC GCG TTA AAC CCA TTC 



Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp 

364 GTC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC 
CAG TAG CTA TGG GAA TGC ACG CCG AAG CGG CTG 

FIG. 12-2 
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Leu Met Gly 
397 CTC ATG GGG 
GAG TAC CCC 
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Tyr lie Pro Leu 
TAC ATA CCG CTC 
ATG TAT GGC GAG 



Val Gly Ala Pro 

GTC GGC GCC CCT 
CAG CCG CGG GGA 



Leu Gly Gly Ala Ala Arg Ala Leu Ala His Gly 

430 CTT GGA GGC GCT GCC AGG GCC CTG GCG CAT GGC 
GAA CCT CCG CGA CGG TCC CGG GAC CGC GTA CCG 



Val Arg Val 

463 GTC CGG GTT 
CAG GCC CAA 



Leu Glu Asp Gly 

CTG GAA GAC GGC 
GAC CTT CTG CCG 



Val Asn Tyr Ala 

GTG AAC TAT GCA 
CAC TTG ATA CGT 



Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 
496 ACA GGG AAC CTT CCT GGT TGC TCT TTC TCT ATC 
TGT CCC TTG GAA GGA CCA ACG AGA AAG AGA TAG 



\ • ■ 



Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val 
529 TTC CTT CTG GCC CTG CTC TCT TGC TTG ACT GTG 
AAG GAA GAC CGG GAC GAG AGA ACG AAC TGA CAC 



'■i ■ • .. ■ 



Pro Ala Ser Ala Tyr Gin Val Arg Asn Ser Thr 
562 CCC GCT TCG GCC TAC CAA GTG CGC AAC TCC ACG 
GGG CGA AGC CGG ATG GTT CAC GCG TTG AGG TGC 



Gly Leu Tyr 

595 GGG CTT TAC 
CCC GAA ATG 



His Val Thr Asn 

CAC GTC ACC AAT 
GTG CAG TGG TTA 



Asp Cys Pro Asn 
GAT TGC CCT AAC 
CTA ACG GGA TTG 



Ser Ser lie 

628 TCG AGT ATT 
AGC TCA TAA 



Val Tyr Glu Ala 

GTG TAC GAG GCG 
CAC ATG CTC CGC 



Ala Asp Ala lie 

GCC GAT GCC ATC 
CGG CTA CGG TAG 



Leu His Thr 

661 CTG CAC ACT 
GAC GTG TGA 



Pro Gly Cys Val 

CCG GGG TGC GTC 
GGC CCC ACG CAG 

FIG. 12-3 



Pro Cys Val Arg 

CCT TGC GTT CGT 
GGA ACG CAA GCA 
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Glu Gly Asn Ala Ser Arg Cys. Trp Val Ala Met 

GAG GGC AAC GCC TCG AGG TGT TGG GTG GCG ATG 
CTC CCG TTG CGG AGC TCC ACA ACC CAC CGC TAC 



Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu 
ACC CCT ACG GTG GCC ACC AGG GAT GGC AAA CTC 
TGG GGA TGC CAC CGG TGG TCC CTA CCG TTT GAG 



Pro Ala Thr Gin Leu Arg Arg His lie Asp Leu 
CCC GCG ACG CAG CTT CGA CGT CAC ATC GAT CTG 
GGG CGC TGC GTC GAA GCT GCA GTG TAG CTA GAC 



Leu Val Gly Ser Ala Thr Leu Cys Ser Ala Leu 

CTT GTC GGG AGC GCC ACC CTC TGT TCG GCC CTC 
GAA CAG CCC TCG CGG TGG GAG ACA AGC CGG GAG 



Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu 

TAC GTG GGG GAC CTA TGC GGG TCT GTC TTT CTT 
ATG CAC CCC CTG GAT ACG CCC AGA CAG AAA GAA 



Val Gly Gin Leu Phe Thr Phe Ser Pro Arg Arg 

GTC GGC CAA CTG TTC ACC TTC TCT CCC AGG CGC 
CAG CCG GTT GAC AAG TGG AAG AGA GGG TCC GCG 



His Trp Thr Thr Gin Gly Cys Asn Cys Ser lie 

CAC TGG ACG ACG CAA GGT TGC AAT TGC TCT ATC 
GTG ACC TGC TGC GTT CCA ACG TTA ACG AGA TAG 



Tyr Pro Gly His lie Thr Gly His Arg Met Ala 

TAT CCC GGC CAT ATA ACG GGT CAC CGC ATG GCA 
ATA GGG CCG GTA TAT TGC CCA GTG GCG TAC CGT 



Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr 

TGG GAT ATG ATG ATG AAC TGG TCC CCT ACG ACG 
ACC CTA TAC TAC TAC TTG ACC AGG GGA TGC TGC 
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Ala Leu Val Met Ala Gin Leu Leu Arg lie Pro 
GCG TTG GTA ATG GCT CAG CTG CTC CGG ATC CCA 
CGC AAC CAT TAC CGA GTC GAC GAG GCC TAG GGT 



Gin Ala He Leu Asp Met He Ala Gly Ala His 

CAA GCC ATC TTG GAC ATG ATC GCT GGT GCT CAC 
GTT CGG TAG AAC CTG TAC TAG CGA CCA CGA GTG 



Trp Gly Val Leu Ala Gly He Ala Tyr Phe Ser 
TGG GGA GTC CTG GCG GGC ATA GCG TAT TTC TCC 
ACC CCT CAG GAC CGC CCG TAT CGC ATA AAG AGG 



Met Val Gly Asn Trp Ala Lys Val Leu Val Val 
ATG GTG GGG AAC TGG GCG AAG GTC CTG GTA GTG 
TAC CAC CCC TTG ACC CGC TTC CAG GAC CAT CAC 



Leu Leu Leu Phe Ala Gly Val Asp Ala Glu Thr 

CTG CTG CTA TTT GCC GGC GTC GAC GCG GAA ACC 
GAC GAC GAT AAA CGG CCG CAG CTG CGC CTT TGG 



His Val Thr Gly Gly Ser Ala Gly His Thr Val 
CAC GTC ACC GGG GGA AGT GCC GGC CAG ACT GTG 
GTG CAG TGG CCC CCT TCA CGG CCG GTG TGA CAC 



Ser Gly Phe Val Ser Leu Leu Ala Pro Gly Ala 

TCT GGA TTT GTT AGC CTC CTC GCA CCA GGC GCC 
AGA CCT AAA CAA TCG GAG GAG CGT GGT CCG CGG 



Lys Gin Asn Val Gin Leu He Asn Thr Asn Gly 

AAG CAG AAC GTC CAG CTG ATC AAC ACC AAC GGC 
TTC GTC TTG CAG GTC GAC TAG TTG TGG TTG CCG 



Ser Trp His Leu Asn Ser Thr Ala Leu Asn Cys 
AGT TGG CAC CTC AAT AGC ACG GCC CTG AAC TGC 
TCA ACC GTG GAG TTA TCG TGC CGG GAC TTG ACG 
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Asn Asp Ser 
1288 AAT GAT AGC 
TTA CTA TCG 
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Leu Asn Thr Gly 
CTC AAC ACC GGC 
GAG TTG TGG CCG 



Trp Leu Ala Gly 
TGG TTG GCA GGG 
ACC AAC CGT CCC 



Leu Phe Tyr His His Lys Phe Asn Ser Ser Gly 

1321 TTT TCT ATC ACC ACA AGT TCA ACT CTT CAG GCT 
GAA AAG ATA GTG GTG TTC AAG TTG AGA AGT CCG 



Cys Pro Glu 
1354 GTC CTG AGA 
ACA GGA CTC 



Thr Asp Phe 

1387 ACC GAT TTT 
TGG CTA AAA 



Tyr Ala Asn 

1420 TAT GCC AAC 
ATA CGG TTG 



Tyr Cys Trp 

1453 TAG TGC TGG 
ATG ACG ACC 



Arg Leu Ala Ser 

GGC TAG CCA GCT 
TCC GAT CGG TCG 



Asp Gin Gly Trp 

GAC CAG GGC TGG 
CTG GTC CCG ACC 



Gly Ser Gly Pro 

GGA AGC GGC CCC 
CCT TCG CCG GGG 



His Tyr Pro Pro 

CAC TAC CCC CCA 
GTG ATG GGG GGT 



Cys Arg Pro Leu 

GCC GAC CCC CTT 
ACG GCT GGG GAA 



Gly Pro lie Ser 

GGC CCT ATC AGT 
CCG GGA TAG TCA 



Asp Gin Arg Pro 

GAC CAG CGC CCC 
CTG GTC GCG GGG 



Lys Pro Cys Gly 

AAA CCT TGC GGT 
TTT GGA ACG CCA 



He Val Pro Ala Lys Ser Val Cys Gly Pro Val 

1486 ATT GTG CCC GCG AAG AGT GTG TGT GGT CCG GTA 
TAA CAC GGG CGC TTC TCA CAC ACA CCA GGC CAT 



Tyr Cys Phe 

1519 TAT TGC TTC 
ATA ACG AAG 



Thr Pro Ser Pro 
ACT CCC AGC CCC 
TGA GGG TCG GGG 



Val Val Val Gly 
GTG GTG GTG GGA 
CAC CAC CAC CCT 



Thr Thr Asp Arg Ser Gly Ala Pro Thr Tyr Ser 
1552 ACG ACC GAC AGG TCG GGC GCG CCC ACC TAC AGC 
TGC TGG CTG TCC AGC CCG CGC GGG TGG ATG TCG 
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Trp Gly Glu Asn Asp Thr Asp Val Phe Val Leu 

1585 TGG GGT GAA AAT GAT ACG GAC GTC TTC GTC CTT 
ACC CCA CTT TTA CTA TGC CTG CAG AAG CAG GAA 



Asn Asn Thr Arg Pro Pro Leu Gly Asn Trp Phe 

1618 AAC AAT ACC AGG CCA CCG CTG GGC AAT TGG TTC 
TTG TTA TGG TCC GGT GGC GAC CCG TTA ACC AAG 



Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr 

1651 GGT TGT ACC TGG ATG AAC TCA ACT GGA TTC ACC 
CCA ACA TGG ACC TAC TTG AGT TGA CCT AAG TGG 



Lys Val Cys Gly Ala Pro Pro Cys Val lie Gly 

1684 AAA GTG TGC GGA GCG CCT CCT TGT GTC ATC GGA 
TTT CAC ACG CCT CGC GGA GGA ACA CAG TAG CCT 



. Gly Ala Gly Asn Asn Thr Leu His Cys Pro Thr 

1717 GGG GCG GGC AAC AAC ACC CTG CAC TGC CCC ACT 
CCC CGC CCG TTG TTG TGG GAC GTG ACG GGG TGA 



Asp Cys Phe Arg Lys His Pro Asp Ala Thr Tyr 

1750 GAT TGC TTC CGC AAG CAT CCG, GAC GCC ACA_TAC 
CTA ACG AAG GCG TTC GTA GGC CTG CGG TGT ATG 



Ser Arg Cys Gly Ser Gly Pro Trp lie Thr Pro 

1783 TCT CGG TGC GGC TCC GGT CCC TGG ATC ACA CCC 
AGA GCC ACG CCG AGG CCA GGG ACC TAG TGT GGG 



Arg Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp 

1816 AGG TGC CTG GTC GAC TAC CCG TAT AGG CTT TGG 
TCC ACG GAC CAG CTG ATG GGC ATA TCC GAA ACC 



His Tyr Pro Cys Thr lie Asn Tyr Thr lie Phe 

1849 CAT TAT CCT TGT ACC ATC AAC TAC ACC ATA TTT 
~~ GTA ATA GGA ACA TGG TAG TTG ATG TGG TAT AAA 
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Lys lie Arg Met Tyr Val Gly Gly Val Glu His 

1882 AAA ATC AGG ATG TAC GTG GGA GGG GTC GAA CAC 
TTT TAG TCC TAC ATG CAC CCT CCC CAG CTT GTG 



Arg Leu 

1915 AGG CTG 
TCC GAC 



Glu Ala Ala 

GAA GCT GCC 
CTT CGA CGG 



Cys Asn Trp 

TGC AAC TGG 
ACG TTG ACC 



Thr Arg Gly 

ACG CGG GGC 
TGC GCC CCG 



Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser 

1948 GAA CGT TGC GAT CTG GAA GAC AGG GAC AGG TCC 
CTT GCA ACG CTA GAC CTT CTG TCC CTG TCC AGG 



Glu Leu Ser Pro Leu Leu Leu Thr Thr Thr Gin 

1981 GAG CTC AGC CCG TTA CTG CTG ACC ACT ACA CAG 
CTC GAG TCG GGC AAT GAC GAC TGG TGA TGT GTC 



Trp Gin Val Leu Pro Cys Ser Phe Thr Thr Leu 
2014 TGG CAG GTC CTC CCG TGT TCC TTC ACA ACC CTA 
ACC GTC CAG GAG GGC ACA AGG AAG TGT TGG GAT 



Pro Ala Leu Ser Thr Gly Leu lie His Leu His 

2047 CCA GCC TTG TCC ACC GGC CTC ATC CAC GTC CAC 
GGT CGG AAC AGG TGG CCG GAG TAG GTG GAG GTG 



Gin Asn lie Val Asp Val Gin Tyr Leu Tyr Gly 
2080 CAG AAC ATT GTG GAC GTG CAG TAC TTG TAC GGG 
GTC TTG TAA CAC CTG CAC GTC ATG AAC ATG CCC 



Val Gly Ser Ser lie Ala Ser Trp Ala lie Lys 

2113 GTG GGG TCA AGC ATC GCG TCC TGG GCC ATT AAG 
CAC CCC AGT TCG TAG CGC AGG ACC CGG TAA TTC 



Trp Glu Tyr Val Val Leu Leu Phe Leu Leu Leu 
2146 TGG GAG TAC GTC GTT CTC CTG TTC CTT CTG CTT 
ACC CTC ATG CAG CAA GAG GAC AAG GAA GAC GAA 
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Ala Asp Ala 
2179 GCA GAC GCG 
CGT CTG CGC 



Arg Val Cys Ser 

CGC GTC TGC TCC 
GCG CAG ACG AGG 



Cys Leu Trp Met 
TGC TTG TGG ATG 
ACG AAC ACC TAC 



Met Leu Leu lie Ser Gin Ala Glu Ala Ala Leu 
2212 ATG CTA CTC ATA TCC CAA GCG GAG GCG GCT TTG 
TAC GAT GAG TAT AGG GTT CGC CTC CGC CGA AAC 



Glu Asn Leu Val lie Leu Asn Ala Ala Ser Leu 

2245 GAG AAC CTC GTA ATA CTT AAT GCA GCA TCC CTG 
CTC TTG GAG CAT TAT GAA TTA CGT CGT AGG GAC 



Ala Gly Thr His Gly Leu Val Ser Phe Leu Val 

2278 GCC GGG ACG CAC GGT CTT GTA TCC TTC CTC GTG 
CGG CCC TGC GTG CCA GAA CAT AGG AAG GAG CAC 



Phe Phe Cys Phe Ala Trp Tyr Leu Lys Gly Lys 

2311 TTC TTC TGC TTT GCA TGG TAT TTG AAG GGT AAG 
AAG AAG ACG AAA CGT ACC ATA AAC TTC CCA TTC 



Trp Val Pro Gly Ala Val Tyr Thr Phe Tyr Gly 

2344 TGG GTG CCC GGA GCG GTC TAC ACC TTC TAC GGG 
ACC CAC GGG CCT CGC CAG ATG TGG AAG ATG CCC 



Met Trp Pro Leu Leu Leu Leu Leu Leu Ala Leu 

2377 ATG TGG CCT CTC CTC CTG CTC CTG TTG GCG TTG 
TAC ACC GGA GAG GAG GAC GAG GAC AAC CGC AAC 



Pro Gin Arg Ala Tyr Ala Leu Asp Thr Glu Val 
2410 CCC CAG CGG GCG TAC GCG CTG GAC ACG GAG GTG 
GGG GTC GCC CGC ATG CGC GAC CTG TGC CTC CAC 



Ala Ala Ser Cys Gly Gly Val Val Leu Val Gly 

2443 GCC GCG TCG TGT GGC GGT GTT GTT CTC GTC GGG 
CGG CGC AGC ACA CCG CCA CAA CAA GAG CAG CCC 

FIG. 12-9 
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Leu Met Ala Leu Thr Leu Ser Pro Tyr Tyr Lys 

2476 TTG ATG GCG CTG ACT CTG TCA CCA TAT TAC AAG 
AAC TAC CGC GAC TGA GAC AGT GGT ATA ATG TTC 



Arg Tyr lie Ser Trp Cys Leu Trp Trp Leu Gin 

2509 CGC TAT ATC AGC TGG TGC TTG TGG TGG CTT CAG 
GCG ATA TAG TCG ACC ACG AAC ACC ACC GAA GTC 



Tyr Phe Leu Thr Arg Val Glu Ala Gin Leu His 
2542 TAT TTT CTG ACC AGA GTG GAA GCG CAA CTG CAC 
ATA AAA GAC TGG TCT CAC CTT CGC GTT GAC GTG 



Val Trp lie 

2575 GTG TGG ATT 
CAC ACC TAA 



Pro Pro Leu Asn 

CCC CCC CTC AAC 
GGG GGG GAG TTG 



Val Arg Gly Gly 

GTC CGA GGG GGG 
CAG GCT CCC CCC 



Arg Asp Ala Val lie Leu Leu Met Cys Ala Val 

2608 CGC GAC GCC GTC ATC TTA CTC ATG TGT GCT GTA 
GCG CTG CGG CAG TAG AAT GAG TAC ACA CGA CAT 



His Pro Thr Leu Val Phe Asp lie Thr Lys Leu 

2641 CAC CCG ACT CTG- GTA TTT GAG ATC ACC AAA TTG 
GTG GGC TGA GAC CAT AAA CTG TAG TGG TTT AAC 



Leu Leu Ala 
2674 CTG CTG GCC 
GAC GAC CGG 



Val Phe Gly Pro 
GTC TTC GGA CCC 
CAG AAG CCT GGG 



Leu Trp lie Leu 

CTT TGG ATT CTT 
GAA ACC TAA GAA 



Gin Ala Ser Leu Leu Lys Val Pro Tyr Phe Val 

2707 CAA GCC AGT TTG CTT AAA GTA CCC TAC TTT GTG 
GTT CGG TCA AAC GAA TTT CAT GGG ATG AAA CAC 



Arg Val Gin Gly Leu Leu Arg Phe Cys Ala Leu 
2740_ CGC GTC CAA GGC CTT CTC CGG TTC TGC GCG TTA 
~~ GCG CAG GTT CCG GAA GAG GCC AAG ACG CGC AAT 
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Ala Arg Lys Met He Gly Gly His Tyr Val Gin 

2773 GCG CGG AAG ATG ATC GGA GGC CAT TAC GTG CAA 
CGC GCC TTC TAC TAG CCT CCG GTA ATG CAC GTT 



Met Val He He Lys Leu Gly Ala Leu Thr Gly 

2806 ATG GTC ATC ATT AAG TTA GGG GCG CTT ACT GGC 
TAC CAG TAG TAA TTC AAT CCC CGC GAA TGA CCG 



Thr Tyr Val Tyr Asn His Leu Thr Pro Leu Arg 

2839 ACC TAT GTT TAT AAC CAT CTC ACT CCT CTT CGG 
TGG ATA CAA ATA TTG GTA GAG TGA GGA GAA GCC 



Asp Trp Ala His Asn Gly Leu Arg Asp Leu Ala 

2872 GAC TGG GCG CAC AAC GGC TTG CGA GAT CTG GCC 
CTG ACC CGC GTG TTG CCG AAC GCT CTA GAC CGG 



Val Ala Val Glu Pro Val Val Phe Ser Gin Met 

2905 GTG GCT GTA GAG CCA GTC GTC TTC TCC CAA ATG 
CAC CGA CAT CTC GGT CAG CAG AAG AGG GTT TAC 



Glu Thr Lys Leu He Thr Trp Gly Ala Asp Thr 
2938 GAG ACC AAG CTC ATC ACG TGG GGG GCA GAT ACC 
CTC TGG TTC GAG TAG TGC ACC CCC CGT CTA TGG 



Ala Ala Cys Gly Asp He He Asn Gly Leu Pro 

2971 GCC GCG TGC GGT GAC ATC ATC AAC GGC TTG CCT 
CGG CGC ACG CCA CTG TAG TAG TTG CCG AAC GGA 



Val Ser Ala Arg Arg Gly Arg Glu He Leu Leu 

3004 GTT TCC GCC CGC AGG GGC CGG GAG ATA CTG CTC 
CAA AGG CGG GCG TCC CCG GCC CTC TAT GAC GAG 



Gly Pro Ala Asp Gly Met Val Ser Lys Gly Trp 

3037 GGG CCA GCC GAT GGA ATG GTC TCC AAG GGG TGG 
CCC GGT CGG CTA CCT TAC CAG AGG TTC CCC ACC 

FIG. 12-11 
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Arg Leu Leu Ala Pro lie Thr Ala Tyr Ala Gin 

3070 AGG TTG CTG GCG CCC ATC ACG GCG TAC GCC CAG 
TCC AAC GAC CGC GGG TAG TGC CGC ATG CGG GTC 



Gly Leu Leu Gly Cys lie lie Thr 

GGC CTC CTA GGG TGC ATA ATC ACC 
CCG GAG GAT CCC ACG TAT TAG TGG 



Gly Arg Asp Lys Asn Gin Val Glu 

GGC CGG GAC AAA AAC CAA GTG GAG 
CCG GCC CTG TTT TTG GTT CAC CTC 



Gin lie Val Ser Thr Ala Ala Gin 
CAG ATT GTG TCA ACT GCT GCC CAA 
GTC TAA CAC AGT TGA CGA CGG GTT 



Gin Thr Arg 

3103 CAG ACA AGG 
GTC TGT TCC 



Ser Leu Thr 

3136 AGC CTA ACT 
TCG GAT TGA 



Gly GlU Val 
3169 GGT GAG GTC 
CCA CTC CAG 



Thr Phe Leu 

3202 ACC TTC CTG 
TGG AAG GAC 



Ala Thr Cys lie 

GCA ACG TGC ATC 
CGT TGC ACG TAG 



Asn Gly Val Cys 

AAT GGG GTG TGC 
TTA CCC CAC ACG 



Trp Thr Val Tyr His Gly Ala Gly Thr Arg Thr 

3235 TGG ACT GTC TAC CAC GGG GCC GGA ACG AGG ACC 
ACC TGA CAG ATG GTG CCC CGG CCT TGC TCC TGG 



lie Ala Ser Pro Lys Gly Pro Val lie Gin Met 

3268 ATC GCG TCA CCC AAG GGT CCT GTC ATC CAG ATG 
TAG CGC AGT GGG TTC CCA GGA CAG TAG GTC TAC 



Tyr Thr Asn 

3301 TAT ACC AAT 
ATA TGG TTA 



Val Asp Gin Asp 

GTA GAC CAA GAC 
CAT CTG GTT CTG 



Leu Val Gly Trp 

CTT GTG GGC TGG 
GAA CAC CCG ACC 



Pro Ala Pro 

3334 CCC GCT CCG 
GGG CGA GGC 



Gin Gly Ser Arg 

CAA GGT AGC CGC 
GTT CCA TCG GCG 

FIG. 12-12 



Ser Leu Thr Pro 
TCA TTG ACA CCC 
AGT AAC TGT GGG 
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Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val 

3367 TGC ACT TGC GGC TCC TCG GAC CTT TAC CTG GTC 
ACG TGA ACG CCG AGG AGC CTG GAA ATG GAC CAG 



Thr Arg His Ala Asp Val lie Pro Val Arg Arg 

3400 ACG AGG CAC GCC GAT GTC ATT CCC GTG CGC CGG 
TGC TCC GTG CGG CTA CAG TAA GGG CAC GCG GCC 



Arg Gly Asp Ser Arg Gly Ser Leu Leu Ser Pro 
3433 CGG GGT GAT AGC AGG GGC AGC CTG CTG TCG CCC 
GCC CCA CTA TCG TCC CCG TCG GAC GAC AGC GGG 



Arg Pro lie Ser Tyr Leu Lys Gly Ser Ser Gly 

3466 CGG CCC ATT TCC TAC TTG AAA GGC TCC TCG GGG 
GCC GGG TAA AGG ATG AAC TTT CCG AGG AGC CCC 



Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val 
3499 GGT CCG CTG TTG TGC CCC GCG GGG CAC GCC GTG 
CCA GGC GAC AAC ACG GGG CGC CCC GTG CGG CAC 



Gly lie Phe Arg Ala Ala Val Cys Thr Arg Gly 

3532 GGC ATA TTT AGG GCC GCG GTG TGC ACC CGT GGA 
CCG TAT AAA TCC CGG CGC CAC ACG TGG GCA CCT 



Val Ala Lys Ala Val Asp Phe lie Pro Val Glu 
3565 GTG GCT AAG GCG GTG GAC TTT ATC CCT GTG GAG 
CAC CGA TTC CGC CAC CTG AAA TAG GGA CAC CTC 



Asn Leu Glu Thr Thr Met Arg Ser Pro Val Phe 

3598 AAC CTA GAG ACA ACC ATG AGG TCC CCG GTG TTC 
TTG GAT CTC TGT TGG TAC TCC AGG GGC CAC AAG 



Thr Asp Asn Ser Ser Pro Pro Val Val Pro Gin 

3631 ACG GAT AAC TCC TCT CCA CCA GTA GTG CCC CAG 
TGC CTA TTG AGG AGA GGT GGT CAT CAC GGG GTC 

FIG. 12-13 
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Ser Phe Gin Val Ala His Leu His Ala Pro Thr 

3664 AGC TTC CAG GTG GCT CAC CTC CAT GCT CCC ACA 
TCG AAG GTC CAC CGA GTG GAG GTA CGA GGG TGT 



Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala 

3697 GGC AGC GGC AAA AGC ACC AAG GTC CCG GCT GCA 
CCG TCG CCG TTT TCG TGG TTC CAG GGC CGA CGT 



Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu 

3730 TAT GCA GCT CAG GGC TAT AAG GTG CTA GTA CTC 
ATA CGT CGA GTC CCG ATA TTC CAC GAT CAT GAG 



Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 

3763 AAC CCC TCT GTT GCT GCA ACA CTG GGC TTT GGT 
TTG GGG AGA CAA CGA CGT TGT GAC CCG AAA CCA 



Ala Tyr Met Ser Lys Ala His Gly He Asp Pro 

3796 GCT TAC ATG TCC AAG GCT CAT GGG ATC GAT CCT 
CGA ATG TAC AGG TTC CGA GTA CCC TAG CTA GGA 



Asn He Arg. Thr. .Gly Val Arg Thr He Thr Thr 

3829 AAC ATC AGG ACC GGG GTG AGA ACA ATT ACC ACT 
TTG TAG TCC TGG CCC CAC TCT TGT TAA TGG TGA 



Gly Ser Pro He Thr Tyr Ser Thr Tyr Gly Lys 

3862 GGC AGC CCC ATC ACG TAC TCC ACC TAC GGC AAG 
CCG TCG GGG TAG TGC ATG AGG TGG ATG CCG TTC 



Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala 

3895 TTC CTT GCC GAC GGC GGG TGC TCG GGG GGC GCT 
AAG GAA CGG CTG CCG CCC ACG AGC CCC CCG CGA 



Tyr Asp He He He Cys Asp Glu Cys His Ser 
3928 TAT GAC ATA ATA ATT TGT GAC GAG TGC CAC TCC 
ATA CTG TAT TAT TAA ACA CTG CTC ACG GTG AGG 

FIG. 12-14 
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Thr Asp Ala Thr Ser He Leu Gly He Gly Thr 

3961 ACG GAT GCC ACA TCC ATC TTG GGC ATC GGC ACT 
TGC CTA CGG TGT AGG TAG AAC CCG TAG CCG TGA 



Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg 

3994 GTC CTT GAC CAA GCA GAG ACT GCG GGG GCG AGA 
CAG GAA CTG GTT CGT CTC TGA CGC CCC CGC TCT 



Leu Val Val Leu Ala Thr Ala- Thr Pro Pro Gly 

4027 CTG GTT GTG CTC GCC ACC GCC ACC CCT CCG GGC 
GAC CAA CAC GAG CGG TGG CGG TGG GGA GGC CCG 



Ser Val Thr Val Pro His Pro Asn He Glu Glu 

4060 TCC GTC ACT GTG CCC CAT CCC AAC ATC GAG GAG 
AGG CAG TGA CAC GGG GTA GGG TTG TAG CTC CTC 



Val Ala Leu Ser Thr Thr Gly Glu He Pro Phe 

4093 GTT GCT CTG TCC ACC ACC GGA GAG ATC CCT TTT 
CAA CGA GAC AGG TGG TGG CCT CTC TAG GGA AAA 

Tyr Gly Lys Ala He Pro Leu Glu Val He Lys 

4126 TAC GGC AAG GCT ATC CCC CTC GAA GTA ATC AAG 
ATG CCG TTC CGA TAG GGG GAG CTT CAT TAG TTC 



Gly Gly Arg His Leu He Phe Cys His Ser Lys 

4159 GGG GGG AGA CAT CTC ATC TTC TGT CAT TCA AAG 
CCC CCC TCT GTA GAG TAG AAG ACA GTA AGT TTC 



Lys Lys Cys 

4192 "AAG AAG TGC 
TTC TTC ACG 



Asp Glu Leu Ala 
GAC GAA CTC GCC 
CTG CTT GAG CGG 



Ala Lys Leu Val 

GCA AAG CTG GTC 
CGT TTC GAC CAG 



Ala Leu Gly 
4225 GCA TTG GGC 
CGT AAC CCG 



He Asn Ala Val 
ATC AAT GCC GTG 
TAG TTA CGG CAC 
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GCC TAC TAC CGC 
CGG ATG ATG GCG 



SUBSTITUTE SHEET 



WO 91/04262 



PC17US90/05242 



43/ 79 

Gly Leu Asp Val Ser Val He Pro Thr Ser Gly 

4258 GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGC 
CCA GAA CTG CAC AGG CAG TAG GGC TGG TCG CCG 



Asp Val Val Val Val Ala Thr Asp Ala Leu Met 

4291 GAT GTT GTC GTC GTG GCA ACC GAT GCC CTC ATG 
CTA CAA CAG CAG CAC CGT TGG CTA CGG GAG TAC 



Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 

4324 ACC GGC TAT ACC GGC GAC TTC GAC TCG GTG ATA 
TGG CCG ATA TGG CCG CTG AAG CTG AGC CAC TAT 



Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp 
4357 GAC TGC AAT ACG TGT GTC ACC CAG ACA GTC GAT 
CTG ACG TTA TGC ACA CAG TGG GTC TGT CAG CTA 



Phe Ser Leu Asp Pro Thr Phe Thr He Glu Thr 
4390 TTC AGC CTT GAC CCT ACC TTC ACC ATT GAG ACA 
AAG TCG GAA CTG GGA TGG AAG TGG TAA CTC TGT 



4423 ' 



He Thr Leu Pro Gin Asp Ala Val Ser Arg Thr 

ATC ACG CTC CCC CAG GAT GCT GTC TCC CGC ACT 
TAG TGC GAG GGG GTC CTA CGA CAG AGG GCG TGA 



Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro 

4456 CAA CGT CGG GGC AGG ACT GGC AGG GGG AAG CCA 
GTT GCA GCC CCG TCC TGA CCG TCC CCC TTC GGT 



Gly He Tyr Arg Phe Val Ala Pro Gly Glu Arg 

4489 GGC ATC TAC AGA TTT GTG GCA CCG GGG GAG CGC 
CCG TAG ATG TCT AAA CAC CGT GGC CCC CTC GCG 



Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys 
4522 CCC TCC GGC ATG TTC GAC TCG TCC GTC CTC TGT 
-GGG AGG CCG TAC AAG CTG AGC AGG CAG GAG ACA 
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Glu Cys Tyr 

4555 GAG TGC TAT 
CTC ACG ATA 
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Asp Ala Gly Cys 
GAC GCA GGC TGT 
CTG CGT CCG ACA 



Ala Trp Tyr Glu 
GCT TGG TAT GAG 
CGA ACC ATA CTC 



Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg 

4588 CTC ACG CCC GCC GAG ACT ACA GTT AGG CTA CGA 
GAG TGC GGG CGG CTC TGA TGT CAA TCC GAT GCT 



Ala Tyr Met Asn Thr Pro Gly Leu Pro Val Cys 
4621 GCG TAC ATG AAC ACC CCG GGG CTT CCC GTG TGC 
CGC ATG TAC TTG TGG GGC CCC GAA GGG CAC ACG 



Gin Asp His Leu Glu Phe Trp Glu Gly Val Phe 

4654 CAG GAC CAT CTT GAA TTT TGG GAG GGC GTC TTT 
GTC CTG GTA GAA CTT AAA ACC CTC CCG CAG AAA 



Thr Gly Leu 

4687 ACA GGC CTC 
TGT CCG GAG 



Thr His lie Asp 

ACT CAT ATA GAT 
TGA GTA TAT CTA 



Ala His Phe Leu 

GCC CAC TTT CTA 
CGG GTG AAA GAT 



Ser Gin Thr Lys Gin Ser Gly Glu Asn Leu Pro 

4720 TCC CAG ACA AAG CAG AGT GGG GAG AAC CTT CCT 
AGG GTC TGT TTC GTC TCA CCC CTC TTG GAA GGA 



Tyr Leu Val 
4753 TAC CTG GTA 
ATG GAC CAT 



Ala Tyr Gin Ala 

GCG TAC CAA GCC 
CGC ATG GTT CGG 



Thr Val Cys Ala 

ACC GTG TGC GCT 
TGG CAC ACG CGA 



Arg Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin 

4786 AGG GCT CAA GCC CCT CCC CCA TCG TGG GAC CAG 
TCC CGA GTT CGG GGA GGG GGT AGC ACC CTG GTC 



Met Trp Lys 

4819 ATG TGG AAG 
TAC ACC TTC 



Cys Leu lie Arg 

TGT TTG ATT CGC 
ACA AAC TAA GCG 
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Leu Lys Pro Thr 

CTC AAG CCC ACC 
GAG TTC GGG TGG 
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Leu His Gly 
4852 CTC CAT GGG 
GAG GTA CCC 
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Pro Thr Pro Leu 
CCA ACA CCC CTG 
GGT TGT GGG GAC 



Leu Tyr Arg Leu 
CTA TAC AGA CTG 
GAT ATG TCT GAC 



Gly Ala Val 
4885 GGC GCT GTT 
CCG CGA CAA 



Gin Asn Glu lie 
CAG AAT GAA ATC 
GTC TTA CTT TAG 



Thr Leu Thr His 
ACC CTG ACG CAC 
TGG GAC TGC GTG 



Pro Val Thr Lys Tyr lie Met Thr Cys Met Ser 
4918 CCA GTC ACC AAA TAC ATC ATG ACA TGC ATG TCG 
GGT CAG TGG TTT ATG TAG TAC TGT ACG TAC AGC 



Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val 
4951 GCC GAC CTG GAG GTC GTC ACG AGC ACC TGG GTG 
CGG CTG GAC CTC CAG CAG TGC TCG TGG ACC CAC 



Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala 

4984 CTC GTT GGC GGC GTC CTG GCT GCT TTG GCC GCG 
GAG CAA CCG CCG CAG GAC CGA CGA AAC CGG CGC 



Tyr Cys Leu Ser Thr Gly Cys Val Val lie Val 
5017 TAT TGC CTG TCA ACA GGC TGC GTG GTC ATA GTG 
ATA ACG GAC AGT TGT CCG ACG CAC CAG TAT CAC 



* 

Gly Arg Val Val Leu Ser Gly Lys Pro Ala lie 
5050 GGC AGG GTC GTC TTG TCC GGG AAG CCG GCA ATC 
CCG TCC CAG CAG AAC AGG CCC TTC GGC CGT TAG 



lie Pro Asp Arg Glu Val Leu Tyr Arg Glu Phe 
5083 ATA CCT GAC AGG GAA GTC CTC TAC CGA GAG TTC 
TAT GGA CTG TCC CTT CAG GAG ATG GCT CTC AAG 



Asp Glu Met 
5116 GAT GAG ATG 
CTA CTC TAC 



Glu Glu Cys Ser 

GAA GAG TGC TCT 
CTT CTC ACG AGA 
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Gin His Leu Pro 
CAG CAC TTA CCG 
GTC GTG AAT GGC 
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Tyr He Glu Gin Gly Met Met Leu Ala Glu Gin 

5149 TAC ATC GAG CAA GGG ATG ATG CTC GCC GAG CAG 
ATG TAG CTC GTT CCC TAC TAC GAG CGG CTC GTC 



Phe Lys Gin Lys Ala 

5182 TTC AAG CAG AAG GCC 
AAG TTC GTC TTC CGG 



Ala Ser Arg Gin Ala 

5215 GCG TCC CGT CAG GCA 
CGC AGG GCA GTC CGT 



Val Gin Thr Asn Trp 
5248 GTC CAG ACC AAC TGG 
CAG GTC TGG TTG ACC 



5281 



He Gin Tyr Leu Ala 

5314 ATA CAA TAC TTG GCG 
TAT GTT ATG AAC CGC 



Gly Asn 

5347 GGT AAC 
CCA TTG 



Thr Ala 

5380 ACA GCT 
TGT CGA 



Gin Thr Leu Leu Phe 

5413 CAA ACC CTC CTC TTC 
GTT TGG GAG GAG AAG 

FIG. 



Leu Gly Leu Leu Gin Thr 

CTC GGC CTC CTG CAG ACC 
GAG CCG GAG GAC GTC TGG 



Glu Val He Ala Pro Ala 

GAG GTT ATC GCC CCT GCT 
CTC CAA TAG CGG GGA CGA 



Gin Lys Leu Glu Thr Phe 
CAA AAA CTC GAG ACC TTC 
GTT TTT GAG CTC TGG AAG 



Gly Leu Ser Thr Leu Pro 

GGC TTG TCA ACG CTG CCT 
CCG AAC AGT TGC GAC GGA 



Asn He Leu Gly Gly Trp 

AAC ATA TTG GGG GGG TGG 
TTG TAT AAC CCC CCC ACC 

12-19 



Trp Ala Lys His Met Trp Asn Phe He Ser Gly 

TGG GCG AAG CAT ATG TGG AAC TTC ATC AGT GGG 
ACC CGC TTC GTA TAC ACC TTG AAG TAG TCA CCC 



Pro Ala He Ala Ser Leu Met Ala Phe 

CCC GCC ATT GCT TCA TTG ATG GCT TTT 
GGG CGG TAA CGA AGT AAC TAC CGA AAA 



Ala Val Thr Ser Pro Leu Thr Thr Ser 

GCT GTC ACC AGC CCA CTA ACC ACT AGC 
CGA CAG TGG TCG GGT GAT TGG TGA TCG 
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Val Ala Ala Gin Leu Ala Ala Pro Gly Ala Ala 

5446 GTG GCT GCC CAG CTC GCC GCC CCC GGT GCC GCT 
CAC CGA CGG GTC GAG CGG CGG GGG CCA CGG CGA 



Thr Ala Phe Val Gly Ala Gly Leu Ala Gly Ala 
5479 ACT GCC TTT GTG GGC GCT GGC TTA GCT GGC GCC 
TGA CGG AAA CAC CCG CGA CCG AAT CGA CCG CGG 



Ala lie Gly Ser Val Gly Leu Gly Lys Val Leu 
5512 GCC ATC GGC AGT GTT GGA CTG GGG AAG GTC CTC 
CGG TAG CCG TCA CAA CCT GAC CCC TTC CAG GAG 



lie Asp lie Leu Ala Gly Tyr Gly Ala Gly Val 

5545 ATA GAC ATC CTT GCA GGG TAT GGC GCG GGC GTG 
TAT CTG TAG GAA CGT CCC ATA CCG CGC CCG CAC 



Ala Gly Ala Leu Val Ala Phe Lys lie Met Ser 

5578 GCG GGA GCT CTT GTG GCA TTC AAG ATC ATG AGC 
CGC CCT CGA GAA CAC CGT AAG TTC TAG TAC TCG 



Gly Glu Val Pro Ser Thr Glu Asp Leu Val Asn 

5611 GGT GAG GTC CCC TCC ACG GAG GAC CTG GTC AAT 
CCA CTC CAG GGG AGG TGC CTC CTG GAC CAG TTA 



Leu Leu Pro Ala lie Leu Ser Pro Gly Ala Leu 

5644 CTA CTG CCC GCC ATC CTC TCG CCC GGA GCC CTC 
GAT GAC GGG CGG TAG GAG AGC GGG CCT CGG GAG 



Val Val Gly Val Val Cys Ala Ala lie Leu Arg 

5677 GTA GTC GGC GTG GTC TGT GCA GCA ATA CTG CGC 
CAT CAG CCG CAC CAG ACA CGT CGT TAT GAC GCG 



Arg His Val Gly Pro Gly Glu Gly Ala Val Gin 

5710 CGG CAC GTT GGC CCG GGC GAG GGG GCA GTG CAG 
GCC GTG CAA CCG GGC CCG CTC CCC CGT CAC GTC 
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Trp Met Asn Arg Leu lie Ala Phe Ala Ser Arg 

5743 TGG ATG AAC CGG CTG ATA GCC TTC GCC TCC CGG 
ACC TAC TTG GCC GAC TAT CGG AAG CGG AGG GCC 



Gly Asn His Val Ser 
5776 GGG AAC CAT GTT TCC 
CCC TTG GTA CAA AGG 



Pro Thr His Tyr Val Pro 

CCC ACG CAC TAC GTG CCG 
GGG TGC GTG ATG CAC GGC 



Glu Ser Asp Ala Ala Ala Arg Val Thr Ala He 

5809 GAG AGC GAT GCA GCT GCC CGC GTC ACT GCC ATA 
CTC TCG CTA CGT CGA CGG GCG CAG TGA CGG TAT 



Leu Ser Ser Leu Thr 

5842 CTC AGC AGC CTC ACT 
GAG TCG TCG GAG TGA 

Arg Leu His Gin Trp 

5875 CGA CTG CAC CAG TGG 
GCT GAC GTG GTC ACC 



Thr Pro Cys Ser Gly 

5908 ACT CCA TGC TCC GGT 
TGA GGT ACG AGG CCA 



Trp Asp Trp He Cys 
5941 TGG GAC TGG ATA TGC 
ACC CTG ACC TAT ACG 



Val Thr Gin Leu Leu Arg 
GTA ACC CAG CTC CTG AGG 
CAT TGG GTC GAG GAC TCC 



He Ser Ser Glu Cys Thr 

ATA AGC TCG GAG TGT ACC 
TAT TCG AGC CTC ACA TGG 



Ser Trp Leu Arg Asp He 

TGC TGG CTA AGG GAC ATC 
AGG ACC GAT TCC CTG TAG 



Glu Val Leu Ser Asp Phe 
GAG GTG TTG AGC GAC TTT 
CTC CAC AAC TCG CTG AAA 



Lys Thr Trp Leu Lys Ala Lys Leu Met Pro Gin 
5974 AAG ACC TGG CTA AAA GCT AAG CTC ATG CCA CAG 
TTC TGG ACC GAT TTT CGA TTC GAG TAC GGT GTC 



Leu Pro Gly He Pro Phe Val Ser Cys Gin Arg 

6007 CTG CCT GGG ATC CCC TTT GTG TCC TGC CAG CGC 
GAC GGA CCC TAG GGG AAA CAC AGG ACG GTC GGG 

FIG. 12-21 
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Gly Tyr Lys Gly Val Trp Arg Val Asp Gly lie 

6040 GGG TAT AAG GGG GTC TGG CGA GTG GAC GGC ATC 
CCC ATA TTC CCC CAG ACC GCT CAC CTG CCG TAG 



Met His Thr Arg Cys 

6073 ATG CAC ACT CGC TGC 
TAC GTG TGA GCG ACG 



His Cys Gly Ala Glu lie 

CAC TGT GGA GCT GAG ATC 
GTG ACA CCT CGA CTC TAG 



Thr Gly His Val Lys Asn Gly Thr Met Arg lie 

6106 ACT GGA CAT GTC AAA AAC GGG ACG ATG AGG ATC 
TGA CCT GTA CAG TTT TTG CCC TGC TAC TCC TAG 



Val Gly Pro Arg Thr Cys Arg Asn Met Trp Ser 

6139 GTC GGT CCT AGG ACC TGC AGG AAC ATG TGG AGT 
CAG CCA GGA TCC TGG ACG TCC TTG TAC ACC TCA 



Gly Thr Phe Pro lie Asn Ala Tyr Thr Thr Gly 

6172 GGG ACC TTC CCC ATT AAT GCC TAC ACC ACG GGC 
CCC TGG AAG GGG TAA TTA CGG ATG TGG TGC CCG 



Pro Cys Thr Pro Leu Pro Ala Pro Asn Tyr Thr 

6205 CCC TGT ACC CCC CTT CCT GCG CCG AAC TAC ACG 
GGG ACA TGG GGG GAA GGA CGC GGC TTG ATG TGC 



Phe Ala Leu Trp Arg Val Ser Ala Glu Glu Tyr 
6238 TTC GCG CTA TGG AGG GTG TCT GCA GAG GAA TAT 
AAG CGC GAT ACC TCC CAC AGA CGT CTC CTT ATA 



Val Glu lie Arg Gin Val Gly Asp Phe His Tyr 
6271 GTG GAG ATA AGG CAG GTG GGG GAC TTC CAC TAC 
CAC CTC TAT TCC GTC CAC CCC CTG AAG GTG ATG 



Val Thr Gly 

6304 GTG ACG GGT 
CAC TGC CCA 



Met Thr Thr Asp 
ATG ACT ACT GAC 
TAC TGA TGA CTG 
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Asn Leu Lys Cys 

AAT CTC AAA TGC 
TTA GAG TTT ACG 
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Pro Cys Gin Val Pro Ser Pro Glu Phe Phe Thr 
CCG TGC CAG GTC CCA TCG CCC GAA TTT TTC ACA 
GGC ACG GTC CAG GGT AGC GGG CTT AAA AAG TGT 



Glu Leu Asp Gly Val Arg Leu His Arg Phe Ala 

GAA TTG GAC GGG GTG CGC CTA CAT AGG TTT GCG 
CTT AAC CTG CCC CAC GCG GAT GTA TCC AAA CGC 



Pro Pro Cys Lys Pro Leu Leu Arg Glu Glu Val 

CCC CCC TGC AAG CCC TTG CTG CGG GAG GAG GTA 
GGG GGG ACG TTC GGG AAC GAC GCC CTC CTC CAT 



Ser Phe Arg Val Gly Leu His Glu Tyr Pro Val 

TCA TTC AGA GTA GGA CTC CAC GAA TAC CCG GTA 
AGT AAG TCT CAT CCT GAG GTG CTT ATG GGC CAT 



Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp 
GGG TCG CAA TTA CCT TGC GAG CCC GAA CCG GAC 
CCC AGC GTT AAT GGA ACG CTC GGG CTT GGC CTG 



Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro 
GTG GCC GTG TTG ACG TCC ATG CTC ACT GAT CCC 
CAC CGG CAC AAC TGC AGG TAC GAG TGA CTA GGG 



Ser His lie Thr Ala Glu Ala Ala Gly Arg Arg 
TCC CAT ATA ACA GCA GAG GCG GCC GGG CGA AGG 
AGG GTA TAT TGT CGT CTC CGC CGG CCC GCT TCC 



Leu Ala Arg Gly Ser Pro Pro Ser Val Ala Ser 

TTG GCG AGG GGA TCA CCC CCC TCT GTG GCC AGC 
AAC CGC TCC CCT AGT GGG GGG AGA CAC CGG TCG 



Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu 

TCC TCG GCT AGC CAG CTA TCC GCT CCA TCT CTC 
AGG AGC CGA TCG GTC GAT AGG CGA GGT AGA GAG 



FIG. 
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Lys Ala Thr Cys Thr Ala Asn His Asp Ser Pro 

6634 AAG GCA ACT TGC ACC GCT AAC CAT GAC TCC CCT 

TTC CGT TGA ACG TGG CGA TTG GTA CTG AGG GGA 



Asp Ala Glu Leu lie Glu Ala Asn Leu Leu Trp 

6667 GAT GCT GAG CTC ATA GAG GCC AAC CTC CTA TGG 
CTA CGA CTC GAG TAT CTC CGG TTG GAG GAT ACC 



Arg Gin Glu Met Gly 

6700 AGG CAG GAG ATG GGC 
TCC GTC CTC TAC CCG 



Glu Ser Glu Asn Lys 
6733 GAG TCA GAA AAC AAA 
CTC AGT CTT TTG TTT 

* ) 

'. ) 

Phe Asp Pro Leu Val 
6766 TTC GAT CCG CTT GTG 
AAG CTA GGC GAA CAC 



Glu lie Ser Val Pro 
6799 GAG ATC TCC GTA CCC 
CTC TAG AGG CAT GGG 



Ser Arg Arg Phe Ala 
6832 TCT CGG AGA TTC GCC 
AGA GCC TCT AAG CGG 



Gly Asn lie Thr Arg Val 

GGC AAC ATC ACC AGG GTT 
CCG TTG TAG TGG TCC CAA 



Val Val lie Leu Asp Ser 

GTG GTG ATT CTG GAC TCC 
CAC CAC TAA GAC CTG AGG 



Ala Glu Glu Asp Glu Arg 

GCG GAG GAG GAC GAG CGG 
CGC CTC CTC CTG CTC GCC 



Ala Glu He Leu Arg Lys 
GCA GAA ATC CTG CGG AAG 
CGT CTT TAG GAC GCC TTC 



Gin Ala Leu Pro Val Trp 
CAG GCC CTG CCC GTT TGG 
GTC CGG GAC GGG CAA ACC 



Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu 
6865 GCG CGG CCG GAC TAT AAC CCC CCG CTA GTG GAG 
CGC GCC GGC CTG ATA TTG GGG GGC GAT CAC CTC 



Thr Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val 
6898 ACG TGG AAA AAG CCC GAC TAC GAA CCA CCT GTG 
" TGC ACC TTT TTC GGG CTG ATG CTT GGT GGA CAC 
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Val His Gly Cys Pro Leu Pro Pro Pro Lys Ser 

6931 GTC CAT GGC TGT CCG CTT CCA CCT CCA AAG TCC 
GAG GTA CCG ACA GGC GAA GGT GGA GGT TTC AGG 



Pro Pro Val Pro Pro Pro Arg Lys Lys Arg Thr 

6964 CCT CCT GTG CCT CCG CCT CGG AAG AAG CGG ACG 
GGA GGA CAC GGA GGC GGA GCC TTC TTC GCC TGC 



Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala 

6997 GTG GTC CTC ACT GAA TCA ACC CTA TCT ACT GCC 
CAC CAG GAG TGA CTT AGT TGG GAT AGA TGA CGG 



Leu Ala Glu Leu Ala Thr Arg Ser Phe Gly Ser 

7030 TTG GCC GAG CTC GCC ACC AGA AGC TTT GGC AGC 
AAC CGG CTC GAG CGG TGG TCT TCG AAA CCG TCG 



Ser Ser Thr Ser Gly lie Thr Gly Asp Asn Thr 

7063 TCC TCA ACT TCC GGC ATT ACG GGC GAC AAT ACG 
AGG AGT TGA AGG CCG TAA TGC CCG CTG TTA TGC 



Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys 
7096 ACA ACA TCC TCT GAG CCC GCC CCT TCT GGC TGC 
TGT TGT AGG AGA CTC GGG CGG GGA AGA CCG ACG 



Pro Pro Asp Ser Asp Ala Glu Ser Tyr Ser Ser 
7129 CCC CCC GAC TCC GAC GCT GAG TCC TAT TCC TCC 
GGG GGG CTG AGG CTG CGA CTC AGG ATA AGG AGG 



Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro 
7162 ATG CCC CCC CTG GAG GGG GAG CCT GGG GAT CCG 
TAC GGG GGG GAC CTC CCC CTC GGA CCC CTA GGC 



Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser 

7195 GAT CTT AGC GAC GGG TCA TGG TCA ACG GTC AGT 
CTA GAA TCG CTG CCC AGT ACC AGT TGC CAG TCA 

FIG. 12-25 
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Ser Glu Ala Asn Ala Glu Asp Val Val Cys Cys 
7228 AGT GAG GCC AAC GCG GAG GAT GTC GTG TGC TGC 
TCA CTC CGG TTG CGC CTC CTA CAG CAC ACG ACG 



Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu Val 

7261 TCA ATG TCT TAC TCT TGG ACA GGC GCA CTC GTC 
AGT TAC AGA ATG AGA ACC TGT CCG CGT GAG CAG 



Thr Pro Cys 

7294 ACC CCG TGC 
TGG GGC ACG 



Ala Ala GlU GlU 
GCC GCG GAA GAA 
CGG CGC CTT CTT 



Gin Lys Leu Pro 

CAG AAA CTG CCC 
GTC TTT GAC GGG 



lie Asn Ala Leu Ser Asn Ser Leu Leu Arg His 

7327 ATC AAT GCA CTA AGC AAC TCG TTG CTA CGT CAC 
TAG TTA CGT GAT TCG TTG AGC AAC GAT GCA GTG 

His Asn Leu Val Tyr Ser Thr Thr Ser Arg Ser 

7360 CAC AAT TTG GTG TAT TCC ACC ACC TCA CGC AGT 
GTG TTA AAC CAC ATA AGG TGG TGG AGT GCG TCA 

Ala Cys Gin Arg Gin Lys Lys Val Thr Phe Asp 

7393 GCT TGC CAA AGG CAG AAG AAA GTC ACA TTT GAC 
CGA ACG GTT TCC GTC TTC TTT CAG TGT AAA CTG 



Arg Leu Gin Val Leu Asp Ser His Tyr Gin Asp 

7426 AGA CTG CAA GTT CTG GAC AGC CAT TAC CAG GAC 
TCT GAC GTT CAA GAC CTG TCG GTA ATG GTC CTG 



Val Leu Lys Glu Val Lys Ala Ala Ala Ser Lys 

7459 GTA CTC AAG GAG GTT AAA GCA GCG GCG TCA AAA 
CAT GAG TTC CTC CAA TTT CGT CGC CGC AGT TTT 



Val Lys Ala Asn Leu Leu Ser Val Glu Glu Ala 

7492 GTG AAG GCT AAC TTG CTA TCC GTA GAG GAA GCT 
CAC TTC CGA TTG AAC GAT AGG CAT CTC CTT CGA 
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Cys Ser Leu 
7525 TGC AGC CTG 
ACG TCG GAC 



54 / 79 



Thr Pro Pro His 

ACG CCC CCA CAC 
TGC GGG GGT GTG 



Ser Ala Lys Ser 

TCA GCC AAA TCC 
AGT CGG TTT AGG 



Lys Phe Gly Tyr Gly 

7558 AAG TTT GGT TAT GGG 
TTC AAA CCA ATA CCC 



His Ala Arg Lys Ala 

7591 CAT GCC AGA AAG GCC 
GTA CGG TCT TTC CGG 



Val Trp Lys Asp Leu 

7624 GTG TGG AAA GAC CTT 
CAC ACC TTT CTG GAA 



Pro lie Asp Thr Thr 

7657 CCA ATA GAC ACT ACC 
GGT TAT CTG TGA TGG 



Ala Lys Asp Val Arg Cys 

GCA AAA GAC GTC CGT TGC 
CGT TTT CTG CAG GCA ACG 



Val Thr His lie Asn Ser 
GTA ACC CAC ATC AAC TCC 
CAT TGG GTG TAG TTG AGG 



Leu Glu Asp Asn Val Thr 
CTG GAA GAC AAT GTA ACA 
GAC CTT CTG TTA CAT TGT 



lie Met Ala Lys Asn Glu 

ATC ATG GCT AAG AAC GAG 
TAG TAC CGA TTC TTG CTC 



Val Phe Cys Val. Gin Pro G^u Lys Gly Gly Arg 

7690 GTT TTC TGC GTT CAG CCT GAG AAG GGG GGT CGT 
CAA AAG ACG CAA GTC GGA CTC TTC CCC CCA GCA 



Lys Pro Ala Arg Leu lie Val Phe Pro Asp Leu 
7723 AAG CCA GCT CGT CTC ATC GTG TTC CCC GAT CTG 
TTC GGT CGA GCA GAG TAG CAC AAG GGG CTA GAC 



Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr 

7756 GGC GTG CGC GTG TGC GAA AAG ATG GCT TTG TAC 
CCG CAC GCG CAC ACG CTT TTC TAC CGA AAC ATG 



Asp Val Val Thr Lys Leu Pro Leu Ala Val Met 

7789 GAC GTG GTT ACA AAG CTC CCC TTG GCC GTG ATG 
" CTG CAC CAA TGT TTC GAG GGG AAC CGG CAC TAC 

FIG. 1 2-27 
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Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly 

GGA AGC TCC TAC GGA TTC CAA TAC TCA CCA GGA 
CCT TCG AGG ATG CCT AAG GTT ATG AGT GGT CCT 



Gin Arg Val Glu Phe Leu Val Gin Ala Trp Lys 
CAG CGG GTT GAA TTC CTC GTG CAA GCG TGG AAG 
GTC GCC CAA CTT AAG GAG CAC GTT CGC ACC TTC 



Ser Lys Lys Thr Pro Met Gly Phe Ser Tyr Asp 

TCC AAG AAA ACC CCA ATG GGG TTC TCG TAT GAT 
AGG TTC TTT TGG GGT TAC CCC AAG AGC ATA CTA 



Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser 

ACC CGC TGC TTT GAC TCC ACA GTC ACT GAG AGC 
TGG GCG ACG AAA CTG AGG TGT CAG TGA CTC TCG 



Asp He Arg Thr Glu Glu Ala He Tyr Gin Cys 
GAC ATC CGT ACG GAG GAG GCA ATC TAC CAA TGT 
CTG TAG GCA TGC CTC CTC CGT TAG ATG GTT ACA 



Cys Asp Leu Asp Pro Gin Ala Arg Val Ala He 

TGT GAC CTC GAC CCC CAA GCC CGC GTG GCC ATC 
ACA CTG GAG CTG GGG GTT CGG GCG CAC CGG TAG 



Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly 

AAG TCC CTC ACC GAG AGG CTT TAT GTT GGG GGC 
TTC AGG GAG TGG CTC TCC GAA ATA CAA CCC CCG 



Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly 

CCT CTT ACC AAT TCA AGG GGG GAG AAC TGC GGC 
GGA GAA TGG TTA AGT TCC CCC CTC TTG ACG CCG 



Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr 

TAT CGC AGG TGC CGC GCG AGC GGC GTA CTG ACA 
ATA GCG TCC ACG GCG CGC TCG CCG CAT GAC TGT 

FIG. 12-28 
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Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr lie 

8119 ACT AGC TGT GGT AAC ACC CTC ACT TGC TAC ATC 
TGA TCG ACA CCA TTG TGG GAG TGA ACG ATG TAG 



Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu 

8152 AAG GCC CGG GCA GCC TGT CGA GCC GCA GGG CTC 
TTC CGG GCC CGT CGG ACA GCT CGG CGT CCC GAG 



Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp 
8185 CAG GAC TGC ACC ATG CTC GTG TGT GGC GAC GAC 
GTC CTG ACG TGG TAC GAG CAC ACA CCG CTG CTG 



Leu Val Val lie Cys Glu Ser Ala Gly Val Gin 

8218 TTA GTC GTT ATC TGT GAA AGC GCG GGG GTC CAG 
AAT CAG CAA TAG ACA CTT TCG CGC CCC CAG GTC 



; l ; Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu 

, 8251 GAG GAC GCG GCG AGC CTG AGA GCC TTC ACG GAG 

!• CTC CTG CGC CGC TCG GAC TCT CGG AAG TGC CTC 



Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp 

8284 GCT ATG ACC AGG TAC TCC GCC CCC CCT GGG GAC 
CGA TAC TGG TCC ATG AGG CGG GGG GGA CCC CTG 



Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu lie 
8317 CCC CCA CAA CCA GAA TAC GAC TTG GAG CTC ATA 
GGG GGT GTT GGT CTT ATG CTG AAC CTC GAG TAT 



Thr Ser Cys Ser Ser Asn Val Ser Val Ala His 

8350 ACA TCA TGC TCC TCC AAC GTG TCA GTC GCC CAC 
TGT AGT ACG AGG AGG TTG CAC AGT CAG CGG GTG 



Asp Gly Ala Gly Lys Arg Val Tyr Tyr Leu Thr 
8383 GAC GGC GCT GGA AAG AGG GTC TAC TAC CTC ACC 
CTG CCG CGA CCT TTC TCC CAG ATG ATG GAG' TGG 

FIG. 12-29 
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Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala 

8416 CGT GAC CCT ACA ACC CCC CTC GCG AGA GCT GCG 

GCA CTG GGA TGT TGG GGG GAG CGC TCT CGA CGC 



Trp Glu Thr Ala Arg His Thr Pro Val Asn Ser 
8449 TGG GAG ACA GCA AGA CAC ACT CCA GTC AAT TCC 
ACC CTC TGT CGT TCT GTG TGA GGT CAG TTA AGG 



Trp Leu Gly Asn lie lie Met Phe Ala Pro Thr 

8482 TGG CTA GGC AAC ATA ATC ATG TTT GCC CCC ACA 
ACC GAT CCG TTG TAT TAG TAC AAA CGG GGG TGT 



Leu Trp Ala Arg Met lie Leu Met Thr His Phe 

8515 CTG TGG GCG AGG ATG ATA CTG ATG ACC CAT TTC 
GAC ACC CGC TCC TAC TAT GAC TAC TGG GTA AAG 



Phe Ser Val Leu lie Ala Arg Asp Gin Leu Glu 

8548 TTT AGC GTC CTT ATA GCC AGG GAC CAG CTT GAA 
AAA TCG CAG GAA TAT CGG TCC CTG GTC GAA CTT 



Gin Ala Leu Asp Cys Glu lie Tyr Gly Ala Cys 

8581 CAG GCC CTC GAT TGC GAG ATC TAC GGG GCC TGC 
GTC CGG GAG CTA ACG CTC TAG ATG CCC CGG ACG 



Tyr Ser lie Glu Pro Leu Asp Leu Pro Pro lie 

8614 TAC TCC ATA GAA CCA CTT GAT CTA CCT CCA ATC 
ATG AGG TAT CTT GGT GAA CTA GAT GGA GGT TAG 



lie Gin Arg Leu His Gly Leu Ser Ala Phe Ser 

8647 ATT CAA AGA CTC CAT GGC CTC AGC GCA TTT TCA 
TAA GTT TCT GAG GTA CCG GAG TCG CGT AAA AGT 



Leu His Ser Tyr Ser Pro Gly Glu lie Asn Arg 

8680 CTC CAC AGT TAC TCT CCA GGT GAA ATT AAT AGG 
<5AG GTG TCA ATG AGA GGT CCA CTT TAA TTA TCC 

FIG. 1 2-3% 
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Val Ala Ala 

8713 GTG GCC GCA 
CAC CGG CGT 



cys Leu Arg Lys 

TGC CTC AGA AAA 
ACG GAG TCT TTT 



Leu Gly Val Pro 

CTT GGG GTA CCG 
GAA CCC CAT GGC 



Pro Leu Arg Ala Trp Arg His Arg Ala Arg Ser 

8746 CCC TTG CGA GCT TGG AGA CAC CGG GCC CGG AGC 
GGG AAC GCT CGA ACC TCT GTG GCC CGG GCC TCG 

Val Arg Ala Arg Leu Leu Ala Arg Gly Gly Arg 
8779 GTC CGC GCT AGG CTT CTG GCC AGA GGA GGC AGG 
CAG GCG CGA TCC GAA GAC CGG TCT CCT CCG TCC 

Ala Ala lie Cys Gly Lys Tyr Leu Phe Asn Trp 

8812 GCT GCC ATA TGT GGC AAG TAC CTC TTC AAC TGG 
CGA CGG TAT ACA CCG TTC ATG GAG AAG TTG ACC 



Ala Val Arg Thr Lys Leu Lys 
8845 GCA GTA AGA ACA AAG CTC AAA C 
CGT CAT TCT TGT TTC GAG TTT G 

FIG. 12-31 
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59/ 7 9 primer J159S 

Jl ACTGCCCTGAACTGCAATGA 
PT G 



Ser Leu Lys Thr Gly Phe Leu Ala Ala 

Jl 1 C TCC CTC AAA ACT GGG TTT CTT GCC GCG 

p«T T AG C C C GG T G A G 

Asn Trp Gly 



Leu Phe Tyr Thr His Lys Phe Asn Ala 

jl 29 CTG TTC TAC ACA CAC AAG TTC AAC GCG 

PT T T CAC T T 

His Ser 
primer 166A for Jl-1216 
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Gly lie Val Pro Ala Ser Gin Val Cys 
Jl 191 GGT ATC GTA CCC GCG TCG CAG GTG TGC 

PT T G AA AGT T 

Lys Ser 

Gly Pro Val Tyr Cys Phe Thr Pro Ser 

Jl 218 GGT CCA GTG TAT TGC TTC ACC CCA AGC 

PT G A T C 

Pro Val Val Val Gly Thr Thr Asp Arg 

Jl 245 CCT GTT GTA GTG GGG ACG ACC GAT CGT 

PT CGG A CAG 



Phe Gly Ala Pro Thr Tyr Asn Trp Gly 

Jl 272 TTC GGC GCC CCT ACG TAT AAC TGG GGG 

PT CG G C C C G T 

Ser Ser 



Asp Asn Glu Thr Asp Val Leu lieu Leu 
Jl 299 GAC AAT GAG ACG GAC GTG CTG CTC CTA 

PT A T C T C G T 

Glu Asp Phe Val 



Asn Asn Thr Arg Pro Pro His Gly Asn 
Jl 326 AAC AAC ACG CGG CCC CCG CAC GGC AAC 

PT T C A A TG T 

Leu 



Trp Phe Gly Cys Thr 

Jl 353 TGG TTC GGC TGT ACA 

PT T CTGGATGAACTCAACTGGATT 

primer 19 9 A 



Nucleotide Match: 259/367 (70.6%) 

Amino Acid Match (stringent): 93/122 (76.2%) 

(relaxed): 111/122 (91.0%) 



; > 
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Prototype HCV (PT) sequences different from 
Japanese HCV (Jl) are shown. 

Relaxed amino acid match: Gly=Ala=Pro=Ser=Thr , 

Asp=Glu, Asn=Gln, 

Aug=Lys=His, Leu=Ile=Val=Met , Phe=Trp=Tyr. 
Underline, different amino acid in relaxed 
matching. 

FIG. 13-3 



SUBSTITUTE 



WO 91/04262 



62/7S 



PCT/US90/05242 



Core to NS1 vs. HCV-l 
jl Pro Leu Val 

HCV-1 T CCG CTC GTC 

A 



Gly Ala Pro Leu Gly Gly Ala Ala Arg 

Jl ll GGC GCC CCC TTA GGG GGC GCT GCC AGG 



Ala Leu Ala His Gly Val Arg Val Leu 

jl 38 GCC CTG GCA CAT GGT GTC CGG GTT CTG 

G C 



Glu Asp Gly Val Asn Tyr Ala Thr Gly 

Jl 65 GAG GAC GGC GTG AAC TAT GCA ACA GGG 



Asn Leu Pro Gly Cys Ser Phe Ser lie 

Jl , 92 AAT TTG CCC GGT TGC TCT TTC TCT ATC 



. ■ » - 

. ! ■ ' I" 



Phe Leu Leu Ala Leu Leu Ser Cys Leu 

Jl 119 TTC CTC TTG GCT CTG CTG TCC TGT TTG 

— T C — — C — T — C 



Thr lie Pro Ala Ser Ala Tyr Glu Val 

Jl 146 ACC ATC CCA GCT TCC GCT TAT GAA GTG 

— T G-G — C — G — C — C C — 

Val Gin 



Arg Asn Val Ser Gly lie Tyr His Val 

Jl 17 3 CGC AAC GTG TCC GGG ATA TAC CAT GTC 

TCC A-G C-T — C 

Ser Thr Leu 



— . Thr Asn Asp Cys Ser Asn Ser Ser lie 

Jl 200 ACA AAC GAC TGC TCC AAC TCA AGC ATT 

— C — T — T — C-T — — G — T 

Pro 

. ' .. .. FIG. 14-1 
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Val Tyr Glu Ala Ala Asp Val lie Met 
J1 227 GTG TAT GAG GCG GCG GAC GTG ATC ATG 

— C — C — T -CC C — 

Ala Leu 



*• His Ala Pro Gly Cys Val Pro Cys Val 

Jl 254 CAT GCC CCC GGG TGC GTG CCC TGC GTT 

— C A-T ~ G C — T 

Thr 



Arg Glu Asn Asn Ser Ser Arg Cys Trp 

j! 281 CGG GAG AAC AAT TCC TCC CGT TGC TGG 

T GG- — C G — — G A-G — T 

Gly Ala 



i 1 

j i" 



V : 



i s 
.V. 

I 

K 1 ' 

I* * 

( 1' 



Val Ala Leu Thr Pro Thr Leu Ala Ala 
jl 308 GTA GCG CTC ACT CCC ACG CTC GCG GCC 

__ A A-G — C — T G-G — C A~ 

Met Val Thr 



Arg Asn Ala Ser Val Pro Thr Thr Thr 

Jl 335 AGG AAT GCC AGC GTC CCC ACT ACG ACA 

G — -G- -AA C — G-G CAG 

Asp Gly Lys Leu Ala 

Gin 



Leu Arg Arg His Val Asp Leu Leu Val 

Jl 362 TTA CGA CGC CAC GTC GAC TTG CTC GTT 

C-T T A- T C T — C 



Gly Thr Ala Ala Phe Cys Ser Ala Met 

j! 389 GGG ACG GCT GCT TTC TGC TCC GCT ATG 

GC — C A-C C — — T — G — C C-C 

Ser Thr Leu Leu 



Tyr Val Gly Asp Leu Cys Gly Ser Val 

j! 416 TAC GTG GGG GAT CTC TGC GGA TCT GTT 

C — A — G — C 
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Phe Leu lie Ser Gin Leu Phe Thr Phe 

Jl 443 TTC CTC ATC TCC CAG CTG TTC ACC TTC 

— T — T G — . GG- — A 

Val Gly 



Jl 


470 


Ser 

TCG 
— T 


Pro 
CCT 
— C 


Arg 
CGC 
A-G 


Arg 

CGG 
— C 


His 
CAT 
— C 


Glu 
GAG 
TG- 
Trp 


Thr 
ACA 
— G 


Val 
GTA 
ACG 
Thr 


Gin 
CAG 
— A 


Jl 


497 


Asp 
GAC 
-GT 
Gly 


Cys 
TGC 


Asn 
AAC 


Cys 
TGC 


Ser 
TCA 


He 
ATC 


Tyr 
TAT 


Pro 
CCC 


Gly 
GGC 


Jl 


524 


His 

CAC 


Val 
GTA 

lie 


Ser 
TCA 

Thr 


Gly His Arg Met Ala Trp 
GGC CAT CGC ATG GCT TGG 


Jl 


551 


Asp 
GAT 


Met 

ATG 


Met 
ATG 


Met 

ATG 


Asn 
AAC 


Trp 
TGG 


Ser 
TCG 


Pro 
CCC 


Thr 

ACG 


Jl 


578 


Ala 
GCA 
A-G 
Thr 


Ala 

GCC 
— G 


Leu 

TTA 
— G 


Val 
GTG 
— A 


Val 

GTG 
A — 
Met 


Ser 
TCG 
G-T 
Ala 


Gin 
CAG 


Leu 
TTA 
C-G 


Leu 

CTC 


Jl 


605 


Arg 
CGG 


He 
ATC 


Pro 

CCA 


Gin 
CAA 


Ala 
GCT 


Val 
GTC 
A — 


Met 
ATG 
T — 


Asp 
GAC 


Met 
ATG 














— C 


He 


Leu 






Jl 


632 


Val 
GTG 
A-C 


Ala 
GCG 


Gly 
GGG 


Ala 
GCC 
— T 


His 
CAC 


Trp 
TGG 


Gly 
GGA 


Val 
GTC 


Leu 
CTA 



He 
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Ala Gly Leu Ala Tyr Tyr Ser Met Val 

Jl 659 GCG GGC CTT GCC TAC TAT TCC ATG GTG 

A-A — G — T -TC 

lie Phe 



Gly Asn Trp Ala Lys Val Leu lie Val 

Jl 686 GGG AAC TGG GCT AAG GTT TTG ATT GTG 

G — ' C C — G-A 

Val 



Met Leu Leu Phe Ala Gly Val Asp Gly 

713 ATG CTA CTC TTT GCC GGC GTT GAC GGG 

c G —A C C- 

Leu Al a 



His Thr Arg Val Thr Gly Gly Val Gin 

740 CAT ACC CGC GTG ACG GGG GGG GTG CAA 
G-A -A- — C — C — A AGT GCC 

Glu His Ser Ala 



Gly His Val Thr Ser Thr Leu Thr Ser 
767 GGC CAC GTC ACC TCT ACA CTC ACG TCC 
ACT GTG GGA T-T GTT AG- 

Thr Val Gly Phe Val 
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Cys Asn Asp Ser Leu Gin Thr Gly Phe 

875 TGC AAT GAC TCC CTC CAA ACT GGG TTC 

T AG- — - A-C — C — C -GG 

Asn Trp 



Jl 



Leu Ala Ala 
902 CTT GCC GCG CTG TTC TAC ACA CAC AAG 

T-G — A -G- — T — T CAC C — 

Gly His 



Jl 



929 TTC AAC GCG TCC GGA TGC CCG GAG CGC 
T-T — A — C — T — T A-G 



Ser lie Asp Lys 
ATG GCC AGC TGT CGC TCC ATT GAC AAG 
Jl 956 C-A ■ C —A C— C— AC- G-T 

Leu Pro Leu Thr Asp 



Phe Asp Gin Gly Trp Gly Pro lie Thr 

Jl 983 TTC GAC CAG GGA TGG GGT CCC ATC ACC 

— T — c — C — T -GT 

Ser 



Tyr Ala Gin Pro Asp Asn Ser Asp Gin 

Jl 1010 TAT GCT CAA CCT GAC AAC TCG GAC CAG 

C AAC GGA AGC GG- C-C 

Asn Gly Ser Gly Pro 



Arg Pro Tyr Cys Trp His Tyr Ala Pro 
Jl 1037 AGG CCG TAT TGC TGG CAC TAC GCA CCT 

C-C — C — C C-C — A 

Pro 



Arg Gin Cys Gly lie Val Pro Ala Ser 

Jl 1064 CGA CAG TGT GGT ATC GTA CCC GCG TCG 

AA- -CT — C — T — G AA- 

Lys Pro Lys 

I FIG. 14-5 
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Gin Val Cys Gly Pro Val Tyr Cys Phe 
Jl 1091 CAG GTG TGC GGT CCA GTG TAT TGC TTC 

AGT — T — G — A 



Thr Pro Ser Pro Val Val Val Gly Thr 
Jl 1118 ACC CCA AGC CCT GTT GTA GTG GGG ACG 

— T — c — c — G — G — A 



Thr Asp Arg Phe Gly Ala Pro Thr Tyr 

Jl 1145 ACC GAT CGT TTC GGC GCC CCT ACG TAT 

A-G -CG G — C — C — C 



Asn Trp Gly Asp Asn Glu Thr Asp Val 
Jl 1172 AAC TGG GGG GAC AAT GAG ACG GAC GTG 

-G — T — A — T — C 

Glu Asp 



Leu Leu Leu Asn Asn Thr Arg Pro Pro 

Jl 1199 CTG CTC CTA AAC AAC ACG CGG CCC CCG 

T-C G — — T — — T — C A — --A 

Phe Val 



His Gly Asn Trp Phe Gly Cys Thr 

Jl 1226 CAC GGC AAC TGG TTC GGC TGT ACA 

-TG — T — T — 

Leu 

FIG. 14-6 
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Gly Asn Trp Phe Gly Cys Thr Trp Met 

TG GGC AAC TGG TTC GGC TGT ACA TGG ATG 
— T — T — C 



Asn Ser Thr 

AAT AGC ACT 
— C TCA 



Gly Phe Thr 

GGG TTC ACC 



Lys Thr Cys 

AAG ACG TGC 
— A GT 

Val 



Gly Gly Pro 

GGA GGC CCC 
-CG — T 



Pro Cys Asn 

CCG TGT AAC 

— T GT- 

Val 



lie Gly Gly 

ATC GGG GGG 



Val Gly Asn Asn Thr Leu Thr Cys Pro 

GTC GGC AAC AAC ACC TTG ACC TGC CCC 
Ala His 

* .*/ ' 

Thr Asp Cys Phe Arg Lys Thr Pro Thr 

ACG GAC TGC TTC CGG AAG ACC CCG ACG 

T T — C CAT GAC 

His Asp 



Ala Thr Tyr 

GCC ACT TAC 



Thr Lys Cys 

ACA AAA TGT 
T-T CGG — C 

Ser Arg 



Gly Ser Gly 

GGT TCG GGC 
— C — C — T 



Pro Trp Leu 
CCT TGG TTG 
— C A-C 

He 



Thr Pro Arg 

ACA CCT AGG 

C — 



Cys Leu Val 

TGC TTG GTT 
C C 



Asp Tyr Pro Tyr Arg Leu Trp His Tyr 

GAC TAC CCA TAC AGG CTC TGG CAC TAC 
G ~ T T T — T 
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Jl 219 
HCV-1 



Pro Cys Thr Val As.n Phe Thr lie Phe 

CCC TGC ACT GTC AAC TTT ACC ATC TTC 
— T — t — C A — -AC — A — T 

Tyr 



Jl 246 
HCV-1 



Lys Val Arg Met Tyr Val Gly Gly Val 

AAG GTT AGG ATG TAT GTG GGG GGC GTG 
— A A-C C A — G ~C 



Jl 273 
HCV-1 



Glu His 
GAG CAC 
— A 
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70/ 79 

C200 region sequence vs. HCV-1 



Asn Met Ser 

C2 oo 3799 ATG TCC 
HCV-1 3781 ACA CTG GGC TTT GGT GCT T-C 

Thr Leu Gly Phe Gly Ala Tyr 



C200 

HCV-1 



Lys Ala His Gly Thr Asp Pro Asn lie 

3808 AAG GCA CAT GGC ACC GAC CCC AAC ATC 
T g -T- — T — T 



C200 
HCV-1 



Arg Thr Gly Val Arg Thr He Thr Thr 

3835 AGA ACT GGG GTA AGG ACC ATC ACC ACA 
— G — C G —A —A — T T 



C200 
HCV-1 



Gly Ala Pro He Thr Tyr Ser Thr Tyr 

3862 GGT GCC CCC ATT ACG TAC TCC ACC TAT 
— C AG C — C 



C200 
HCV-1 



Arg Lys Phe Leu Ala Asp Gly Gly Cys 

3889 CGC AAG TTC CTT GCC GAC GGT GGT TGC 
G C — G 

Gly 



C200 
HCV-1 



Ser Gly Gly Ala Tyr Asp He He 

3916 TCC GGG GGC GCC TAT GAC ATC ATA A 

q — T — A -TT 

He 



HCV-1 



3943 TGT GAC GAG TGC CAC TCC ACG GAT GCC 

Cys Asp Glu Cys His Ser Thr Asp Ala 



HCV-1 3970 ACA TCC ATC TTG GGC ATC GGC ACT GTC 

Thr Ser He Leu Gly He Gly Thr Val 

FIG. 16-1 
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HCV-1 



71/79 

3997 CTT GAC CAA GCA GAG ACT GCG GGG GCG 

Leu Asp Gin Ala Glu Thr Ala Gly Ala 



HCV-1 



4024 AGA CTG GTT GTG CTC GCC ACC GCC ACC 

Arg Leu Val Val Leu Ala Thr Ala Thr 



HCV-1 



4051 CCT CCG GGC TCC GTC ACT GTG CCC CAT 

Pro Pro Gly Ser Val Thr Val Pro His 



HCV-1 



4078 CCC AAC ATC GAG GAG GTT GCT CTG TCC 

Pro Asn lie Glu Glu Val Ala Leu Ser 



HCV-1 



4105 ACC ACC GGA GAG ATC CCT TTT TAC GGC 

Thr Thr Gly Glu lie Pro Phe Tyr Gly 



C200 
HCV-1 



Ser lie Pro lie Glu Ala lie Lys 

4132 A AGC ATC CCC ATC GAG GCC ATC AAG 
AAG GCT C A -TA 

Lys Ala Va l 



C200 
HCV-1 



Gly Gly Arg His Leu lie Phe Cys His 
4159 GGG GGA AGG CAT CTC ATC TTC TGC CAT 
G —A T 



C200 
HCV-1 



Ser Lys Lys Lys Cys Asp Glu Leu Ala 

4186 TCC AAG AAG AAG TGT GAC GAG CTC GCC 
—A C A 



C200 
HCV-1 



Ala Lys Leu Ser Ala Leu Gly Leu Asn 

4213 GCA AAG CTG TCA GCC CTC GGA CTC AAT 
GTC — A T-G — C A — 

Val 



Ala Val Ala Tyr Tyr Arg Gly Leu Asp 

C200 ^ 4240 GCC GTG GCG TAT TAC CGC GGT CTT GAT 
HCV-1 c — C C 

FIG. 16-2 
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Val Ser Val lie Pro Thr Ser Gly Asp 

C200 4267 GTG TCC GTC ATA CCA ACT AGC GGA GAC 

HCV-1 C — G ~ C C — T 



Val Val Val Val Ala Thr Asp 

C200 4294 GTC GTT GTC GTG GCA ACA GAC GC 4316 
HCV-1 — T — C — C — T — C CTC 



HCV-1 4321 ATG ACC GGC TAT ACC GGC GAC TTC GAC 

Met Thr Gly Tyr Thr Gly Asp Phe Asp 



HCV-1 4348 TCG GTG ATA GAC TGC AAT ACG TGT GTC 

Ser Val lie Asp Cys Asn Thr Cys Val 



HCV-1 4375 ACC CAG ACA GTC GAT TTC AGC CTT GAC 

Thr Gin Thr Val Asp Phe Ser Leu Asp 



HCV-1 4402 CCT ACC TTC ACC ATT GAG ACA ATC ACG 

Pro Thr Phe Thr lie Glu Thr lie Thr 



HCV-1 4429 CTC CCC CAG GAT GCT GTC TCC CGC ACT 

Leu Pro Gin Asp Ala Val Ser Arg Thr 



HCV-1 4456 CAA CGT CGG GGC AGG ACT GGC AGG GGG 

Gin Arg Arg Gly Arg Thr Gly Arg Gly 



HCV-1 4483 AAG CCA GGC ATC TAC AGA TTT GTG GCA 

Lys Pro Gly lie Tyr Arg Phe Val Ala 



HCV-1 4510 CCG GGG GAG CGC CCC TCC GGC ATG TTC 

Pro Gly Glu Arg Pro Ser Gly Met Phe 



HCV-1 4537 GAC TCG TCC GTC CTC TGT GAG TGC TAT 

Asp Ser Ser Val Leu Cys Glu Cys Tyr 

;,, FIG. 16-3 
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HCV-1 4564 GAC GCA GGC TGT GCT TGG TAT GAG CTC 

Asp Ala Gly Cys Ala Trp Tyr Glu Leu 



HCV-1 4591 ACG CCC GCC GAG ACT ACA GTT AGG CTA 

Thr Pro Ala Glu Thr Thr Val Arg Leu 



HCV-1 4618 CGA GCG TAC ATG AAC ACC CCG GGG CTT 

Arg Ala Tyr Met Asn Thr Pro Gly Leu 



HCV-1 4645 CCC GTG TGC CAG GAC CAT CTT GAA TTT 

Pro Val Cys Gin Asp His Leu Glu Phe 



HCV-1 4672 TGG GAG GGC GTC TTT ACA GGC CTC ACT 

Trp Glu Gly Val Phe Thr Gly Leu Thr 



HCV-1 4699 CAT ATA GAT GCC CAC TTT CTA TCC CAG 

His lie Asp Ala His Phe Leu Ser Gin 



HCV-1 4726 ACA AAG CAG AGT GGG GAG AAC CTT CCT 

Thr Lys Gin Ser Gly Glu Asn Leu Pro 



HCV-1 4753 TAC CTG GTA GCG TAC CAA GCC ACC GTG 

Tyr Leu Val Ala Tyr Gin Ala Thr Val 



HCV-1 4780 TGC GCT AGG GCT CAA GCC CCT CCC CCA 

Cys Ala Arg Ala Gin Ala Pro Pro Pro 



HCV-1 4807 TCG TGG GAC CAG ATG TGG AAG TGT TTG 

Ser Trp Asp Gin Met Trp Lys Cys Leu 



HCV-1 4834 ATT CGC CTC AAG CCC ACC CTC CAT GGG 

He Arg Leu Lys Pro Thr Leu His Gly 



HCV-1 4861 CCA ACA CCC CTG CTA TAC AGA CTG GGC 

Pro Thr Pro Leu Leu Tyr Arg Leu Gly 

FIG. 16-4 
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HCV-1 4888 GCT GTT CAG AAT GAA ATC ACC CTG ACG 

Ala Val Gin Asn Glu lie Thr Leu Thr 



HCV-1 4915 CAC CCA GTC ACC AAA TAC ATC ATG ACA 

His Pro Val Thr Lys Tyr lie Met Thr 



HCV-1 



4942 TGC ATG TCG GCC GAC CTG GAG GTC GTC 

Cys Met Ser Ala Asp Leu Glu Val Val 



HCV-1 



4969 ACG AGC ACC TGG GTG CTC GTT GGC GGC 
Thr Ser Thr Trp Val Leu Val Gly Gly 



HCV-1 4996 GTC CTG GCT GCT TTG GCC GCG TAT TGC 

Val Leu Ala Ala Leu Ala Ala Tyr Cys 

i" . '. . ■ 

HCV-1 5023 CTG TCA ACA GGC TGC GTG GTC ATA GTG 

Leu Ser Thr Gly Cys Val Val lie Val 



HCV-1 5050 GGC AGG GTC GTC TTG TCC GGG AAG CCG 

Gly Arg Val Val Leu Ser Gly Lys Pro 



C200 
HCV-1 



Glu Val Leu 
GAA GTC CTC 



5077 GCA ATC ATA CCT GAC AGG 

Ala lie lie Pro Asp Arg 



Tyr Arg Glu Phe Asp Glu Met Glu Glu 
C200 5104 TAC CGA GAG TTC GAT GAG ATG GAA GAG 
HCV-1 



Cys Ala Ser His Leu Pro Tyr lie Glu 

C200 5131 TGC GCC TCA CAC CTC CCC TAC ATC GAA 

HCV-1 T-T CAG T-A — G — G 

Ser Gin 

FIG. 16-5 
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Gin Gly Met Gin Leu Ala Glu Gin Phe 

5158 CAG GGA ATG CAG CTC GCC GAG CAA TTC 

— A — G AT — G 

Met 



Lys Gin Lys Ala Leu Gly Leu Leu Gin 
5185 AAG CAG AAG GCG CTC GGG TTG CTG CAA 
C C C-C — G 



Thr Ala Thr Lys Gin Ala Glu Ala Ala 
5212 ACA GCC ACC AAG CAA GCG GAG GCT GCT 
— C — G T — CGT — G — A -T- ATC 

Ser Arg Val lie 



Ala Pro Cys Glu Ser Met His Ala Ser 

5239 GCT CCG TGT GAG TCA ATG CAC GCC TCG 

— C — T GC- -TC CAG -CC A — TGG CAA 

Ala Val Gin Thr Asn Trp Gin 



5266 A 

-AA CTC GAG ACC TTC TGG GCG AAG CAT 

Lys Leu Glu Thr Phe Trp Ala Lys His 



5293 ATG TGG AAC TTC ATC AGT GGG ATA CAA TA 

Met Trp Asn Phe lie Ser Gly lie Gin 

FIG. 16-6 
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NS1 Sequence vs. .HCV-1 

Leu Gly Asn Trp Phe Gly Cys Thr Trp 

JX 1 G TTG GGC AAT TGG TTC GGT TGC ACC TGG 

HCV-1 ~ c T 



Jl 

HCV-1 



29 



Met Asn Ser Ser Gly Phe Thr Lys Val 

ATG AAC TCA TCT GGA TTT ACC AAA GTG 

— A C 

Thr 



Jl 

HCV-1 
Ala 



56 



Cys Gly Ala Pro Pro Cys Val He Gly 
TGC GGA GCG CCT CCT TGT GTC ATC GGA 



Jl 

HCV-1 



Jl 

HCV-1 



Jl 

HCV-1 



83 



110 



137 



Gly Val Gly Asn Asn Thr Leu Gin Cys 
GGG GTG GGC AAC AAC ACC TTG CAA TGC 





Ala 












His 




Pro 

CCC 


Thr Asp 

ACT GAC 


Cys 

TGT 


Phe 
TTC 


Arg 

CGC 


Lys 

AAG 


His 
CAT 


Pro 

CCG 


Asp 
GAC 


Ala 

GCC 


Thr 
ACA 


Tyr 
TAC 


Ser 
TCT 


Arg 
CGG 


Cys 
TGC 


Gly 

GGT 


Ser 
TCC 



Gly Pro Trp He Thr Pro Arg Cys Leu 

j! 164 GGT CCC TGG ATT ACG CCC AGG TGC CTG 
HCV-1 c — A 



Val His Tyr Pro Tyr Arg Leu Trp His 

jl 191 GTC CAC TAC CCT TAT AGG CTT TGG CAT 

HCV-1 G G 

Asp 



Tyr Pro Cys Thr Val Asn Tyr Thr Leu 

Jl~ 218 TAT CCC TGT ACT GTC AAC TAC ACC TTG 

T — C A — A-A 

^ He He 



HCV-1 
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i 



Jl 

HCV-l 



245 



Phe Lys Val Arg Met Tyr Val Gly Gly 

TTC AAA GTC AGG ATG TAC GTG GGA GGG 
— T A — 

He 



Jl 

HCV-l 



272 



Val Glu His Arg Leu Glu Val Ala Cys 

GTC GAG CAC AGG CTG GAA GTT GCT TGC 
A C C 

Ala 



Jl 

HCV-l 



299 



Asn Trp Thr Arg Gly Glu Arg Cys Asp 

AAC TGG ACG CGG GGC GAG CGT TGT GAT 
A C 



Jl 

HCV-l 



326 



Leu Asp Asp Arg Asp 

CTG GAC GAC AGG GAC A 

— A 

GlU 

FIG. 17-2 
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WO 91/04262 PCT/US90/05242 

Core Sequence vs. HCV-1 

JX i GCGTCTAGCCATGGCGTTAGTATGAGTGTC 
HCV-1 

t ji 31 GTGCAGCCTCCAGGACCCCCCCTCCCGGGAGAGCC 

HCV-1 ' 

j! 66 ATAGTGGTCTGCGGAACCGGTGAGTACACCGGAAT 
HCV-1 

j! ioi TGCCAGGACGACCGGGTCCTTTCTTGGATCAACCC 
HCV-1 

Jl 136 GCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCGA 
HCV-1 ' A ~ 

Jl 171 GACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGC 
HCV-1 

Jl 206 CTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGC 
HCV-1 T 

Met Ser 

Jl 241 CCCGGGAGGTCTCGTAGACCGTGCATCATG AGC 

Thr Asn Pro Lys Pro Gin Arg Lys Thr 
Jl 274 ACA AAT CCT AAA .CCT CAA AGA AAA ACC 
HCV-1 — G A- -A- 

Lys Asn 

Lys Arg Asn Thr Asn Arg Arg Pro Gin 

Jl 301 AAA CGT AAC ACC AAC CGC CGC CCA CAG 
HCV-1 T 

Asp Val Lys Phe Pro Gly Gly Gly Gin 

Jl 328 GAC GTC AAG TTC CCG GGC GGT GGT CAG 
HCV-1 T — C 

He Val Gly Gly Val Tyr Leu Leu Pro 

> Jl 355 ATC GTT GGT GGA GTT TAC CTG TTG CCG 

HCV-1 T 

Arg Arg Gly Pro Arg Leu Gly Val Arg 

Jl 382 CGC AGG GGC CCC AGG TTG GGT GTG CGC 
HCV-1 ■ T — A 
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FIG 18-1 

SUBSTITUTE SHEET 



WO 91/04262 



PCT/US90/05242 

79/79 



Ala Thr Arg Lys Thr Ser Glu Arg Ser 

409 GCG ACT AGG AAG ACT TCC GAG CGG TCG 



HCV- 


1 












Jl 
HCV- 


1 


436 


Gin Pro Arg Gly Arg Arg Gin 

CAA CCT CGT GGA AGG CGA CAA 


Pro lie 

CCT ATC 


Jl 




463 


Pro 
CCC 


Lys 
AAG 


Ala Arg Gin Pro Glu 
GCT CGC CAG CCC GAG 


Gly Arg 
GGC AGG 


HCV- 










Arg 




Jl 




490 


Ala 
GCC 


Trp 
TGG 


Ala Gin Pro Gly Tyr 

GCT CAG CCC GGG TAC 


Pro Trp 
CCT TGG 


HCV- 


■1 




Thr 








Jl 
HCV- 


■1 


517 


Pro 

CCC 


Leu 

CTC 


Tyr Gly Asn Glu Gly Met Gly 

TAT GGC AAC GAG GGC ATG GGG 

Cys 


Jl 




544 


Trp 

TGG 


Ala 

GCA 


Gly Trp Leu 

GGA TGG CTC CT 




HCV- 


-1 






FIG. 18-2 
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